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Passion is just the start 



L ast month, 65 Nobel laureates gathered in 
Lindau, Germany, for an annual exchange of 
knowledge between these most honored scientists 
and a group of selected young researchers from 
around the globe. To extend the reach of the 
conference. Science hosted a webinar to address a 
common concern of young scientists everywhere: 
persevering in science. Some advice on this topic may 
have been unexpected. 

Elizabeth Blackburn and 
Jack Szostak (both 2009 
laureates in Physiology or 
Medicine, for the discovery 
of how chromosomes are 
protected by telomeres) and 
Daniel Shechtman (2011 lau- 
reate in Chemistry, for the 
discovery of quasi-periodic 
crystals) fielded questions 
submitted from more than 
3100 young scientists who 
registered for the discussion. 

Not lost on these Nobelists 
was the discouragement 
that young scientists feel 
about their career pros- 
pects. Many find themselves 
in serial postdocs for many 
years while fielding rounds 
of rejection letters for aca- 
demic positions. In the face 
of this disheartening situa- 
tion, there was some tough 
advice. Given the current competitive climate for hiring 
and advancement, the panel agreed that young scientists 
who are not deeply passionate about their research need 
to reconsider their career choices. And even if one has 
such extraordinary passion, it alone is not enough. The 
panelists stressed the importance of becoming a lead- 
ing expert in something new and in demand. They also 
emphasized the need to hone people skills, as science is 
more and more a team effort, and to become able com- 
municators in order to share goals and achievements 
with potential funders and the public. Graduate students 
also need to be smart in selecting a lab for postdoctoral 
training; ideal attributes are a collaborative atmosphere, 
opportunities for leadership in publications, and expo- 
sure to more than just one senior leader in the field who 
will be able to write a letter of recommendation. 

Certainly, the path to a successful academic career, no 
less to a Nobel Prize, is not easy. The panelists also faced 



obstacles, periods of self-doubt, and career setbacks. But 
if a young scientist faces despair from too many failed 
academic job searches, one panelist had some tough ad- 
vice: It’s time to reassess options as to whether academia 
is the right track. The panelists encouraged thoughtful 
consideration of industry and government research labs, 
because in many cases, they offer more stable funding, 
and an impressive number of Nobel laureates came from 
these environments. 

A large part of persever- 
ing in science is balancing a 
demanding scientific career 
with personal life. Panelists 
emphasized the importance 
of flexibility; adjusting 
one’s scientific career to 
meet the needs of a fam- 
ily for example. I certainly 
resonated with this advice. 
A seagoing oceanographer 
early in my career, I decided 
to move into a different po- 
sition that allowed me to 
stay ashore when my three 
daughters reached the age 
when having their mother 
home regularly really mat- 
tered. Once they all were 
done with college, I was 
open to job opportunities 
for which more time away 
from home was required. 
For young scientists in 
particularly competitive fields, they are challenged with 
doing everything they can to stand out from the crowd. 
In this regard, I am delighted by the growing number 
of events that are devoted to helping young scientists 
network with very influential leaders who might help 
advance their careers (such as the Lindau Nobel Laure- 
ate Meeting) and prizes that recognize the early-career 
accomplishments of young scientists (such as the Sci- 
LifeLab Prize and the Eppendorf Prize, both of which 
are administered in conjunction with Science). Advisors 
and mentors may not have the time to look out for such 
resume-enhancing possibilities for their trainees. I en- 
courage ambitious young scientists to investigate these 
opportunities. 

Many Nobel Prize winners spoke about their “good 
luck,” but on closer inspection, in most cases, they had 
created the conditions for their own good luck to happen. 

- Marcia McNutt 




“...if a young scientist faces 
despair...It’s time to reassess 
options...” 




Marcia McNutt 
Editor-in- Chief 
Science Journals 



10.1126/science.aac9472 
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I’ve secretly been working on a lander. 55 

New Horizons principal investigator Alan Stern, at a press conference 
14 July as the spacecraft made its closest approach to Pluto. 

Stern was joking with reporters about plans to return to the dwarf planet. 




Plants in cold storage 




National Museum of Natural History curator Vicki Funk and a summer field team member collect 
samples from the U.S. Botanic Garden. 



I n the race to preserve biodiversity even as species extinction rates 
soar, Smithsonian Institution scientists are starting with their 
own backyards. Plants from the U.S. Botanic Garden, the U.S. 
National Arboretum, and the Smithsonian Gardens— all located in 
Washington, D.C.— are the first targets of an initiative to capture 
the genomic diversity of half of the world’s living plant genera in 
less than 2 years. On 8 July, the Smithsonian’s National Museum of 
Natural History launched the effort, which will include a summer field 
team of students who will assist scientists with the museum’s Global 
Genome Initiative in sampling plants from the gardens’ holdings. The 
scientists will then preserve them in cryogenic vials and store them in 
liquid nitrogen. GGI, the Norway-based Svalbard Global Seed Vault, and 
the Royal Botanic Gardens’ Millennium Seed Bank in the United States 
are three of the largest ex situ conservation efforts to preserve genetic 
material outside of its natural environment. Ex situ conservation is a 
strategy described in the Intergovernmental Panel on Climate Change’s 
2014 Fifth Assessment Report as an insurance policy against climate 
change and other potential sources of biodiversity loss. 



AROUND THE WORLD 

“Cures” bill clears U.S. House 

WASHINGTON, D.c. | A bill to Speed 
the discovery and development of new 
medical treatments sailed through the 
U.S. House of Representatives last week. 
The legislation, known as the 21st Century 
Cures Act, calls for a temporary “innova- 
tion fund” that would give the National 
Institutes of Health an extra $8.75 billion 
over 5 years. It also aims to smooth the 
approval of high-priority treatments; for 
example, one provision allows the Food 
and Drug Administration to approve anti- 
biotics for rare life-threatening diseases 
based on smaller clinical trials. Although 
the bill enjoys the backing of hundreds 
of industry, research, and patient orga- 
nizations, it has also prompted concerns 
that the effort to speed cures might 
unintentionally compromise patients’ 
safety. The updated version of the bill will 
now head to the Senate, which has yet to 
release its own biomedical innovation bill. 
http://scim.ag/housecures 

Helium rules stifle competition 

WASHINGTON, D.c. | Ncw ruIcs on Selling 
off a U.S. government cache of helium- 
essential for many types of technology 
and scientific research— aren’t working as 
planned, witnesses said last week before 
the Committee on Natural Resources in the 
U.S. House of Representatives. The 2013 
Helium Stewardship Act aimed to establish 
a competitive market for federal helium by 
phasing in an auction instead of having the 
U.S. Bureau of Land Management (BLM) 
sell it for a fixed price. But the law may 
be stifling competition, the panel said, by 
giving an unfair advantage to companies 
that have storage contracts with BLM to 
receive the crude helium and refine it. 
“Nonrefiners” who buy federal helium must 
also work out a “tolling” deal to get one of 
the refiners to process it. BLM previously 
reserved 10% of helium for fixed-price 
“nonallocated” sales to nonrefiners, but at 
BLM’s first helium auction in July 2014, 
two refiners bought it all, shutting the nine 
nonrefiners out of the bidding. 
http://scim.ag/heliumrules 
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Infrared light 
(bottom) reveals the 
spots of the black 
leopard (top). 




Seeing (black leopard) spots 



S cientists keep track of the population density of leopards in the wild using camera 
traps: they snap pictures of the cats and identify individuals by their character- 
istic spot patterns. But, due to a recessive gene that causes melanism, most of 
the leopards prowling the Malay Peninsula are black— even their spots— making 
it nearly impossible to distinguish them using the cameras. Now, there’s a work- 
around, scientists report in the Journal of Wildlife Management. The spots are visible 
at infrared wavelengths, so by modifying infrared flash camera traps on the peninsula 
so that they were forced into night mode throughout the day, the researchers were ulti- 
mately able to distinguish distinct spot patterns and identify 94% of the animals along 
a wildlife corridor in Malaysia— crucial for keeping tabs on the region’s population over 
time. The work not only represents the first leopard density estimate in Malaysia, the 
authors note, but also the first successful attempt to estimate population size using 
melanistic phenotypes— such as a black leopard’s dark spots. 



Appeals court dodges H5N1 issue 

AMSTERDAM | Influenza researcher Ron 
Fouchier of Erasmus MC in Rotterdam, 
the Netherlands, received a disappointing 
verdict in his legal battle over so-called gain- 
of-function research, in which flu strains 
are made more transmissible in the lab. 

In 2012, the Dutch government demanded 
that Fouchier seek an export license before 
submitting a highly controversial H5N1 
flu study to Science. Fouchier obtained the 
license under protest, and Erasmus MC 
sued the government for infringement of 
academic freedom— a claim a district court 
rejected in 2013 {Science, 8 November 2013, 
p. 676). The case went to Amsterdam’s Court 
of Appeal, which annulled the lower court’s 
decision last week, Fouchier says, because 
Erasmus MC had no standing to sue once it 
had obtained the license. The appeals court 
did not rule on the key issue of whether 
Fouchier ’s research is subject to export con- 
trol. The verdict was not yet public as Science 
went to press. http://scim.ag/FouchierH5Nl 

Deal struck on Iran nukes 

VIENNA I Iran this week agreed to disman- 
tle large pieces of its nuclear program in 
exchange for relief from crippling economic 
sanctions— with science as an important 
beneficiary. The agreement, 2 years in 
the making, would slow Iran’s “breakout 
time”— the time needed to produce enough 
weapons-grade fissile material for one 
bomb— from an estimated 2 to 3 months to 
at least a year. Iran will convert its sensitive 
Fordow uranium enrichment facility into 
an international “nuclear, physics, and 
technology centre,” with Russia helping to 
reconfigure two centrifuge cascades to pro- 
duce stable isotopes for industry. Fordow 
will also host small linear accelerators for 
basic research in nuclear physics and astro- 
physics. Proposals for collaborative projects 
at Fordow will be reviewed at an interna- 
tional workshop. The deal also calls for the 
possibility of having Iran participate in the 
International Thermonuclear Experimental 
Reactor, or ITER, under construction in 
France. http://scim.ag/lranFordow 



NEWSMAKERS 

Plant biologist altered images 

Two investigations found many instances 
of image manipulation in papers by Olivier 
Voinnet, a plant biologist at CNRS, the 
French national research agency, who is cur- 
rently working at the Swiss Federal Institute 
of Technology Zurich (ETH). Voinnet’s 
work came under attack in December on 



a website called PubPeer; a CNRS inves- 
tigation that won’t be made public found 
manipulations that constituted scientiflc 
misconduct and “tarnished” CNRS’s reputa- 
tion, the agency said on 10 July. CNRS 
suspended Voinnet for 2 years. On the 
same day, ETH announced that a panel had 
found problems in 20 of Voinnet’s papers, 
but that those produced while he was at 
ETH did not constitute misconduct under 
the institute’s deflnition. Voinnet received 
an “admonition” but will keep his position 
at ETH. In a statement, he apologized and 
took “complete responsibility.” 



FINDINGS 

Cholera vaccine shows promise 

An inexpensive cholera vaccine may help 
prevent the deadly disease in the more 
than 50 countries, mostly in Asia and 
Africa, where it is endemic, according to 
a new study in The Lancet. Cholera causes 
severe diarrhea and is estimated to kill 
about 95,000 people— a large number of 
whom are children— every year. In the 
flrst-of-its-kind study, scientists tested 
the vaccine, called Shanchol and made in 
India, in 270,000 people in the slums of 
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Dhaka, where the threat of contracting the 
infection is nearly constant. They found 
that the vaccine provided individuals more 
than 50% protection against cholera and 
lowered life-threatening episodes of the 
infection by about 40%. When given as 
part of routine health services, the vaccine, 
which consists of a two-dose regimen that 
currently costs $3.70, could drive down the 
toll considerably, the researchers say. 

A flightless, winged dine 

Researchers have found the largest ever 
dinosaur with full-fledged wings and 
feathers, they report online this week in 
Scientific Reports. The 125 million-year- 
old dino, called Zhenyuanlong suni and 
found in China, was about 1.65 meters 
long, a little longer than a modern condor. 
It belonged to a group of dinos called 
dromaeosaurs, which includes Velociraptor 
from Jurassic Park, and which was 
closely related to early birds. But although 
Zhenyuanlon^s wings had multiple layers 
of birdlike feathers, they were very short 
compared with those of most other winged 



BY THE NUMBERS 

15 

million 

Number of people receiving 
antiretrovirals for HIV in March 2015, 
according to a new UNAIDS report 
released this week. 

834,000 

Number of rabbits, nonhuman pri- 
mates, and other regulated animals 
used in U.S. biomedical research last 
year— a drop of 6% from the previous 
year, and the lowest number since 
data collection began in 1972. 

18.7% 

Increase in global wildfire activity 
over the last 35 years, according to a 
study in Nature Climate Change. 

220 17 JULY 2015 • VOL 349 ISSUE 6245 




The short-armed, winged, and feathered dinosaur Zhenyuan/ong sun/. 



dinos, and thus it was probably unable 
to fly. That leaves the discovery team 
wondering what the wings were for. One 
possibility is that it evolved from ancestors 
that could fly but used its wings mainly for 
sexual display, http://scim.ag/flightlessdino 

Enter the pentaquark 

GENEVA, SWITZERLAND | A long scarch 
paid off this week, when researchers 
at the European high-energy physics 



laboratory CERN announced 
that they had sighted strong 
evidence of particles made 
up of five quarks. Predicted 
by physicist Murray Gell- 
Mann in 1964 to exist 
alongside protons and 
neutrons (which contain 
three quarks) and mesons 
(which are quark-antiquark 
pairs), pentaquarks would 
consist of four quarks and an 
antiquark. Scientists thought 
they had discovered a light 
pentaquark known as theta- 
plus in 2003, but those claims evaporated 
under further scrutiny (Science, 22 April 
2005, p. 478). Physicists at the Large 
Hadron Collider’s LHCb experiment 
instead snared a heavier “charmonium” 
pentaquark by sifting through thousands 
of decays of three-quark lambda-b par- 
ticles. The finding, which still has to be 
confirmed by other groups, suggests the 
existence of a variety of pentaquarks 
with different masses, the scientists say. 
http://scim.ag/pentaquark 




Scientists (top) display a wafer made of 7-nanometer chips (inset). 



IBM back on track with Moore’s law 



BM announced last week that it has manufactured working versions of a new, ultra- 
dense computer chip that will reduce the area needed for a given amount of circuitry 
by half and has four times the capacity of the most powerful chips on the market. 
Currently, commercial chip technology is at the “14-nanometer manufacturing” stage, 
referring to the size of the chip’s smallest features. The new IBM chips have 7-nano- 
meter transistors, making it possible to pack more into the same space and increasing 
the capacity. The new chips were manufactured by using silicon-germanium in some 
regions rather than pure silicon, which the com- 
pany says reduces power requirements 
and makes faster transistor switching 
possible. But IBM hasn’t said when the new 
chips are likely to be commercially available. 
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IN DEPTH 



4.2Q07j)rotest denouncing 
I'wateijoarding as torture; 

several psychologists 
■guided the method’s used 
on U.S. military detainees. 



SCIENTIFIC SOCIETIES 

Torture report prompts APA apology 

Admitting it colluded with U.S., psychologists group to change policies, leadership 



By John Bohannon 

» fter years of denying that it had given 
scientific and ethical legitimacy to 
torture by the U.S. government, the 
American Psychological Association 
(APA) last week accepted the find- 
ing of an external investigation 
that concluded it had done just that. Now, 
with a public apology and sudden wave 
of high-level resignations or retirements, 
APA is struggling to craft an institutional 
response that will satisfy its members and 
long-time detractors, even as some of those 
pilloried in the probe defend themselves 
and their colleagues. 

“This is a crisis,” says Nadine Kaslow, 
a psychologist at Emory University in At- 
^ lanta and a former APA president, who 
I helped launch the investigation. “I regret 
o that the organization didn’t listen to the 
“ critics earlier.” 

I The 542-page report from a former Chi- 
I cago inspector general, David Hoffman, 
o pulls no punches, concluding that APA 
I officials colluded with the U.S. govern- 
g ment to enable the torture of detainees. 

o 

I APA’s Board of Directors quickly released 
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a response, promising among other things 
to recommend a new policy prohibiting 
psychologists from participating in interro- 
gation of persons held in custody by mili- 
tary and intelligence authorities. APA then 
announced the departure of most of its 
staff leadership: CEO Norman Anderson, 
Deputy CEO Michael Honaker, public rela- 
tions director Rhea Farberman, and ethics 
director Stephen Behnke. 

“All of these people were in the know,” 
says former APA President Gerald Koocher, 
a psychologist at DePaul University in Chi- 
cago, Illinois, and the editor of the jour- 
nal Ethics 8c Behavior. Koocher himself 
is under fire, as two of APA’s staunchest 
critics have called for the body to censure 
him as well, after receiving a confidential 
briefing on the Hoffman report earlier this 
summer. 

APA had commissioned the report last 
year after the publication of Pay Any 
Price: Greed, Power and Endless War, a 
book by New York Times reporter James 
Risen that accused the organization of 
providing cover for torture. The report’s 
most damning findings concern a 2005 
APA committee called the Task Force on 



Psychological Ethics and National Secu- 
rity (PENS). The task force was created 
in the midst of revelations that detainees 
were subjected to “enhanced interroga- 
tion” at U.S. government facilities in Iraq, 
Afghanistan, and Guantanamo Bay, Cuba, 
and that psychologists were intimately in- 
volved in both the design and practice of 
these efforts. 

As Hoffman discovered through inter- 
views, medics within the intelligence com- 
munity were “not on board” with such 
interrogations. To quell this internal re- 
sistance, the government hoped to enlist 
support from APA, psychology’s largest pro- 
fessional organization. And the PENS task 
force provided it, concluding in a 2005 state- 
ment that it was ethical for psychologists to 
take part in the interrogation program. 

The PENS decision sparked protests by 
many APA members, some of whom called 
for withholding dues, but Hoffman found 
that they were ignored. “Being involved 
in the intentional harming of detainees ... 
could do lasting damage to the integrity 
and reputation of psychology, a profession 
that purports to ‘do no harm,”’ he writes, 
but “these countervailing concerns were 
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CLINICAL TRIALS 

Researchers seek clear reasons 
when clinical trials end early 

Explanations for abandoning tests of new treatments 
earlier than planned are often hazy 



simply not considered or were highly sub- 
ordinated to APA’s strategic goals.” Accord- 
ing to Hoffman, APA sought to maintain its 
privileged relationship with the Pentagon, 
a massive employer of psychologists. 

Hoffman’s analysis of internal APA 
emails found that the members of the 
PENS task force were carefully chosen in 
a collaboration involving officials at APA, 
the Pentagon, and the Central Intelligence 
Agency, and its conclusions were vetted in 
advance by insiders at both agencies. The 
goal of PENS, Hoffman offers, was not to 
examine the ethics of torture but to “curry 
favor” with the U.S. Defense Department. 

Hoffman’s characterization of PENS is 
unfair, according to Koocher, who was 
one of the architects of the task force. “We 
solicited widely and openly for member- 
ship,” he says. The fact that so many task 
force members came from the military is 
not evidence of collusion but good judg- 
ment. “If you’re focusing on interroga- 
tion in a military context then those are 
the people with the relevant expertise.” As 
for the allegation of currying favor with 
the Pentagon, Koocher is adamant that it 
was not his goal. “No way were we cover- 
ing up for [Vice President] Cheney or [De- 
fense Secretary] Rumsfeld, both of whom I 
cannot stand.” 

Koocher says that he was unaware that 
the torture was ongoing. He points out that 
he, along with other representatives of U.S. 
medical associations, visited the detention 
center at Guantanamo in 2006. “I asked 
hard questions,” he says. When it was later 
revealed that torture continued at the facil- 
ity, “I was extremely upset.” But by then, he 
says, “I was no longer an APA official. What 
was I supposed to do?” 

That sentiment may not save Koocher 
from sanctions. He is on a list of APA mem- 
bers to be banned from APA governance 
“effective immediately”— just one of several 
recommendations from Steven Reisner of 
New York University and Stephen Soldz 
of the Boston Graduate School of Psycho- 
analysis, who also urged that APA’s top ex- 
ecutive, legal, and public relations staff be 
fired. Reisner and Soldz, persistent critics 
of APA’s role in the interrogation program, 
were invited by APA to review the Hoffman 
report in advance and give the society their 
feedback. APA wouldn’t comment specifi- 
cally on the pair’s recommendations; sev- 
eral people on their “staff to be fired” list 
remain with APA. “A lot of change can hap- 
pen, but it will take a lot of time to imple- 
ment it,” Kaslow says. 

APA’s 180° turn is only a start, Soldz says. 
“The APA and the entire psychology profes- 
sion needs to grapple with the enormous 
scandal enveloping psychological ethics.” ■ 



By Jennifer Couzin-Frankel 

L ike a marathon with far more run- 
ners limbering up at the start than 
stumbling across the finish, the race 
to bring a new treatment to market 
has dropouts along the way. About 
12% of clinical trials are reported 
to shut down prematurely. Knowing why 
could help minimize the number of termi- 
nated trials going forward. 

But a paper published earlier this month 
by a group of computational biologists 
suggests that this knowledge isn’t easy 
to come by. The main reason: Companies 
can t}q)e whatever they want into the 
tiny space— a mere 160 characters— allot- 
ted on ClinicalTrials.gov, a registry main- 
tained by the National 
Library of Medicine. 

In their analysis of all 
3122 terminated trials 
on the registry at the 
time their study began, 

“it just seemed like a 
complete mess,” says 
Frederick Roth, a systems biologist and 
geneticist at the University of Toronto in 
Canada. Reasons were often murky, rang- 
ing from “it was decided to not proceed 
with the study at this time” to “SARS epi- 
demic in Asia and Canada.” 

While at Harvard University, Roth and 
two undergraduates came to their project 
quite by accident. They were interested in 
adverse interactions between drugs or be- 
tween drugs and genes. They decided to do 
what he calls an “amateur pass” through 
ClinicalTrials.gov. 

Their search didn’t yield much. The rea- 
sons for termination were so diverse and 
often so vague that Roth and his students 
decided to launch a new project: learn 
more about why clinical trials end early by 
dividing the information given in the short 
blurbs into “buckets,” such as funding, eth- 
ical reasons, or business decisions, so they 
could see the breakdown by category. They 
found that by far the most popular reason 
was insufficient enrollment, accounting for 
about one-third of terminated trials. About 
11% failed to establish efficacy. In all, Roth 



and his students identified 35 categories, 
among them “lost interest,” “inadequate 
design,” and “key staff left.” (In one case, 
an orthopedic surgeon performing a trial’s 
knee surgeries moved elsewhere.) “It’s a 
potpourri of reasons why you terminate,” 
agrees Deborah Zarin, the director of 
CIinicaITriaIs.gov. 

In the new paper, published online 
at bioRxiv.org, Roth and first author 
Theodore Pak, now an M.D./Ph.D. student 
at the Icahn School of Medicine at Mount 
Sinai in New York City, recommend that 
CIinicaITriaIs.gov aim for greater transpar- 
ency by asking sponsors to answer several 
questions when their trials end early. Those 
include whether it even started, whether 
data were ever examined, and whether in- 
terim examinations of 
efficacy or safety played 
a role. 

Zarin welcomes Roth’s 
dive into the myriad ex- 
planations, but she is 
cautious about the ques- 
tions the authors recom- 
mend for trial sponsors. Safety and efficacy 
can’t always be easily separated from each 
other, she says. Zarin is especially interested 
in distinguishing between trials that end be- 
cause of the science versus some other rea- 
son: In May, she and her colleagues reported 
in PLOS ONE that 68% of 905 terminated 
trials with results listed on CIinicaITriaIs.gov 
stopped for reasons other than data accumu- 
lated in the study. Only one-fifth ended early 
because of safety or efficacy concerns from 
trial data, and the rest didn’t give a reason. 

Another unsettling issue is that many 
terminated studies aren’t listed as such 
in CIinicaITriaIs.gov and other databases 
because the entries are not up to date, ac- 
cording to ongoing research by clinical 
epidemiologist Matthias Briel at the Uni- 
versity Hospital of Basel in Switzerland. 
That, combined with sometimes squishy 
reasons offered for trial termination, sug- 
gests to him that one implication of Roth’s 
paper is that “people might then take infor- 
mation from these registries and introduce 
inaccuracies at least, or even bias into their 
own studies, without realizing it.” ■ 



^It just seemed like a 
complete mess.” 

Frederick Roth, University of Toronto 
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The World Health Assembly rejected a proposal to increase membership fees in May. 



GLOBAL HEALTH 

Report prescribes strong medicine for WHO 

Ebola failures show that difficult reforms are needed, an independent panel says 



By Kai Kupferschmidt 

T he World Health Organization (WHO) 
is still struggling to end the Ebola out- 
break in West Africa. But in the wake 
of what many viewed as a sluggish re- 
sponse to the crisis, another battle is 
brewing— about the future role of the 
organization itself. 

On 7 July, an independent panel delivered 
a scathing review of WHO’s performance and 
proposed wide-ranging reforms that would 
enable the agency to better tackle the next 
major health crisis— from giving it more 
money and power to setting up a special, 
semi-independent emergency center. WHO 
“does not currently possess the capacity or 
organizational culture to deliver a full emer- 
gency public health response,” concluded 
the group, led by Dame Barbara Stocking, 
the former chief executive of Oxfam in the 
United Kingdom; it recommended that the 
agency be made “fit for purpose.” 

But although many of the suggestions 
have been praised as sensible, WHO’s com- 
plex, politicized governance structure and 
entrenched bureaucracy make the $2 billion 
U.N. agency difficult to change. Just 2 months 
ago, for instance, the World Health Assembly 
(WHA)— the annual meeting of health minis- 
t ters that is WHO’s decision-making body— re- 
^ jected a proposal by WHO Director-General 
I Margaret Chan to increase member states’ 
> contribution fees by 5%, as the report rec- 
I ommends. Several other suggestions in the 
I report were previously made by a panel that 
I reviewed the HlNl pandemic of 2009, but 
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were never implemented. 

The new report, too, “might just die on 
paper,” says Joanne Liu, president of Doctors 
Without Borders (MSF), which has played 
a major role in the Ebola epidemic. “Every- 
thing at this stage is in the hands of the mem- 
ber states and how willing they are to give a 
second chance to WHO.” 

The six-member panel, appointed by 
WHO’s Executive Board in March, inter- 
viewed WHO sources and outside experts, 
met with representatives of relief organiza- 
tions, and flew to the affected countries. It 
explored why WHO waited until 8 August 
2014, when Ebola had already infected more 
than 1000 people, to declare the outbreak a 
Public Health Emergency of International 
Concern, a status that triggers international 
action. Early warnings about the outbreak’s 
scale within WHO were ignored, the panel 
found, partly because “WHO does not have 
an organizational culture that supports open 
and critical dialogue between senior leaders 
and staff or that permits risk-taking.” 

In addition, local community leaders— 
whose collaboration would prove crucial to 
controlling Ebola— were engaged too late, 
deployments of staff and consultants in the 
field were too short, and communication 
with the media was ineffective. 

The report’s most far-reaching proposal 
is that WHO set up a new Centre for Health 
Emergency Preparedness and Response that 
would bring together WHO experts in out- 
break control and those providing humani- 
tarian assistance, two areas that are now 
separate. The center should be overseen by an 



independent board and headed by “a strong 
leader and strategic thinker,” the report 
notes. It’s a good proposal, says Norwegian 
epidemiologist Preben Aavitsland, the head 
of a researching and consulting company 
in Kristiansand, Norway. But WHO officials 
may resist the idea, because a strong emer- 
gency center could become a state within a 
state, he cautions. “It will be interesting to 
see whether that is followed up.” 

Others say WHO does not need an internal 
capacity to mount a full-on response to every 
outbreak. Rather than expanding, the organi- 
zation should depend more on external part- 
ners like the US. Centers for Disease Control 
and Prevention— which has three times 
WHO’s annual budget— or Public Health 
England, says David He 3 miann, a former 
WHO assistant director-general who is now 
at the London School of Hygiene & Tropical 
Medicine. “To be sustainable, the workforce 
should be outside of WHO,” he says. “I would 
be totally against WHO increasing its staff to 
sit around and wait for outbreaks to occur.” 
The report also stresses the need to boost 
countries’ compliance with the International 
Health Regulations (IHR), which lay out the 
responsibilities of WHO and member states 
during outbreaks. Completely revised and 
modernized after the SARS outbreak of 2003, 
the current IHR require member states to 
have “core public health capacities”— such 
as laboratories and trained staff— to detect 
and fight infectious diseases. So far, only 
64 of 193 member states have confirmed that 
they comply with these rules; WHO should 
develop a plan to bring all countries up to 
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Russia targets Western ties 

Crackdown on “foreign agents” and “undesirable” groups 
threatens private support for science 



speed, the report says. 

That is easier said than done in dirt-poor 
countries where even basic medical services 
are lacking, Liu says. “It’s this long, huge 
shopping list of things that you are sup- 
posed to implement, with no prioritization,” 
she says. WHO should narrow the require- 
ments down to the essentials, Liu argues. 
Aavitsland adds that developed countries 
should help foot the bill. 

In addition, the report says, WHO should 
penalize countries that violate an IHR pro- 
vision banning member states from issu- 
ing travel restrictions if WHO doesn’t deem 
them warranted. More than 40 countries 
imposed unnecessary restrictions during the 
Ebola outbreak, which hurt the economies in 
Guinea, Sierra Leone, and Liberia, and made 
it harder to send people and supplies. The re- 
port recommends giving WHO the power to 
sanction such behavior and to take cases to 
the U.N. Security Council. But that requires 
countries to surrender some national powers, 
something politicians are notoriously reluc- 
tant to do. “The current IHR were the result 
of a political negotiation that took much 
time,” Heymann says. “In order to change 
them, there have to be new negotiations.” 

A spokesperson for WHO says “several 
of the recommendations in the report have 
been addressed already or are being ad- 
dressed.” For instance, the organization 
is looking seriously at establishing a new 
emergency center, he says. Whether any real 
change is possible will become clear at the 
next WHA in May 2016, Aavitsland says. The 
issue of WHO’s budget will surely come up 
again then as well. “Member states keep in- 
creasing their expectations of what WHO will 
deliver, in many areas, without giving the or- 
ganization the means to fulfill these expecta- 
tions,” he says. “I think member states will 
get what they pay for.” 

In 2017, Chan, who has been criticized for 
being too conflict-averse, is stepping down; 
a new, more forceful personality at the helm 
could make a huge difference, says Ilona 
Kickbusch, an independent global health 
consultant based in Brienz, Switzerland, 
and one of the authors of the new report. A 
group looking at proposals to reform WHO’s 
governance has discussed limiting directors- 
general to one 7-year term instead of two 
5-year terms, she says. Free of the need to be 
reelected, WHO leaders might find it easier 
to challenge member states. 

Liu is hopeful that a stronger WHO will 
emerge from the soul-searching. “The reality 
is today, there is only one organization that 
has the legitimacy and the authority to call 
a Public Health Emergency of International 
Concern, and that is not MSF, or the World 
Bank. It is WHO,” she says. “So they need to 
have the capacity to do that.” ■ 
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By Vladimir Pokrovsky 

T he news last week that the Dynasty 
Foundation, Russia’s only private 
funder of scientific research, is clos- 
ing its doors adds to a darkening 
prospect for philanthropic support of 
Russian science. The decision by the 
D 5 masty Foundation’s council, announced 
8 July in a terse one-sentence notice on 
Djmasty’s Russian-language website, came 
weeks after the Russian government had 
labeled Dynasty a “foreign agent” under a 
recently enacted law. 



That move was part of what many see as a 
growing official crackdown on organizations 
the government considers subversive. A new 
mechanism, separate from the foreign-agent 
law, threatens to label such groups as “un- 
desirable” and make collaborating with 
them illegal, potentially curtailing their sup- 
port for scientists. “These two stories are 
very S5miptomatic and replicate each other,” 
says Mikhail Gelfand of the Russian Acad- 
emy of Sciences’ (RAS’) Institute for Infor- 
mation Transmission Problems in Moscow, 
one of Russia’s top biologists. 

In May, the Russian Ministry of Justice 
added Djmasty to its list of foreign agents— a 
new designation for organizations that re- 
ceive funding from the West {Science, 5 June, 
p. 1067)— on the grounds that Dynasty’s 
founder, telecommunications mogul Dmitry 



Zimin, funds the foundation from personal 
accounts held in Western banks. Zimin was 
infuriated and left Russia in early June. 
D 5 masty’s governing council decided to try to 
keep operating without personal donations 
from Zimin, but it declared the foreign agent 
label “absolutely unacceptable.” The final 
straw came in mid- June, when a court fined 
Dynasty 300,000 rubles (about $5500) for 
refusing to register as a foreign agent. 

Anna Piotrovskaya, the foundation’s ex- 
ecutive director, told the press that she can- 
not say exactly when it will close but that all 
of Dynasty’s obligations to current grantees 



will be fulfilled. Vladimir Putin’s spokesman 
Dmitry Peskov said the Kremlin regrets that 
the foundation will be liquidated. But he 
said no one forced the closure. 

Some Russian scientists, however, called 
the move outrageous. Zimin’s public humili- 
ation was an “especially dishonorable ac- | 
tion,” says Evgeny Onishchenko, a researcher ^ 
at RAS’ Lebedev Physical Institute in Mos- | 
cow. “He was unselfishly helping to develop ^ 
and popularize science in Russia, but the au- % 
thorities have . . . publicly made a spy of him.” | 

As Dynasty’s board deliberated, Russian 5 
authorities were enacting even more draco- | 
nian measures. A new law, which was passed < 
in May and took effect in June, authorizes | 
the Russian prosecutor general to label “un- | 
desirable” any group deemed to threaten the | 
foundations of constitutional order, national I 
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defense, or national security. Organizations 
so designated must cease activities in Russia 
and will have their bank accounts blocked. 
Groups or individuals who cooperate with 
them, even outside Russia, face fines and 
could be sentenced to as many as 6 years in 
prison for a second offense. 

On 8 July, the Federation Council— the 
upper chamber of Russia’s parliament- 
published a suggested list of 12 undesirable 
organizations for the prosecutor general to 
investigate. The list includes the MacArthur 
Foundation, which has run grant programs 
in Russia, and the Open Society Founda- 
tions of Hungarian-born philanthropist 
George Soros. Parliamentarians say many 
more names will follow. Press reports have 
suggested that they could include the Ford 
Foundation, the International Research & 
Exchanges Board (which awards grants for 
postgraduate study in the United States), 
and other donors. 

Seeing Soros’s name on the list was be- 
wildering, researchers say, because one of 
his charities— the International Science 
Foundation (ISF)— helped ensure the sur- 
vival of Russian science in the mid-1990s. 
During the economic turmoil that followed 
the collapse of the Soviet Union, ISF spent 
more than $100 million on assistance that 
included 35,000 emergency research grants 
in Russia and funding to build 32 university 
computer centers in Russia’s major cities. 
Soros’s foundations haven’t been active in 
Russia since 2003 and have shown no sign 
of plans to return. 

“Soros has done very much” for Russia, 
says Pavel Arseniev, who directed ISF’s Rus- 
sian office in the mid-1990s. Arseniev notes 
that the government of former Prime Min- 
ister Viktor Chernomjrdin had partnered 
with ISF in funding travel grants. Gelfand 
says the Russian scientific community is 
partly to blame. “They did not react to ac- 
cusations that Soros allegedly was bujdng 
[Russian] secrets.” 

As for Dynasty, Arseniev faults its law- 
yers for giving the government an opening. 
“If you work in an aggressive environment, 
you cannot ignore such details,” he says. 
“Zimin, though he never positioned himself 
as a political figure, has always behaved very 
independently, which undoubtedly annoyed 
§ the conservative part of the establishment. 
^ Once they got a possibility, they took him 
° off the stage.” 

^ Onishchenko says the attack on D 5 masty 
I and the black list are troublingly reminis- 
5 cent of the Soviet era. “As in the former 
5 gloomy time, they create the image of an en- 
^ emy who is responsible for the majority of 
§ Russia’s problems,” he says. “And those who 
^ helped Russian science get persona non 
Q grata status.” ■ 



By Jeffrey Mervis 

According to the conventional wisdom, the 
cost of the things needed to do biomedical 
research in the United States rises faster 
than the price tags on all consumer goods 
and services. For 3 decades, an index cre- 
ated by the National Institutes of Health 
(NIH) has documented that disparity, giv- 
ing lobbyists more ammunition to plead 
with lawmakers that NIH’s annual budget 
hikes should exceed the overall rate of in- 
flation in the country. 

The NIH index, which captures the cost 
of such things as reagents, test animals, 
and scientific equipment, has at times 
outpaced that broader index by as many 
as three percentage points. But in 2012, 
a strange thing happened that challenges 
the conventional wisdom; 

The Biomedical Research 
and Development Price 
Index (BRDPI) fell below 
the Gross Domestic Prod- 
uct Price Index (GDP PI), a 
variation of the more famil- 
iar Consumer Price Index. 

The biomedical index’s 1.3% 
growth rate that year not 
only trailed the GDP Pi’s 
1.9% but also was the low- 
est in the BRDPI’s history. 

(The news, which NIH re- 
ported in January 2014 after the final 2012 
numbers had been crunched, went largely 
unnoticed at the time.) 

To find out why that year was such an 
anomaly, one needs to know what goes into 
the BRDPI. NIH told Science that the infor- 
mation wasn’t publicly available, but that it 
could be obtained under the federal Free- 
dom of Information Act (FOIA). We did so, 
and learned that the index is dominated not 
by the cost of equipment and supplies, but 
rather by the salaries and benefits paid from 
a grant. In fact, overall personnel costs typi- 
cally account for two-thirds of the change in 
the index from one year to the next. 

The outsized effect of salaries and ben- 
efits on biomedical inflation became clear 
after Congress passed a spending bill in 
December 2011 that lowered the salary 
ceiling for investigators on a standard NIH 
grant from $199,700 to $179,700. Legisla- 
tors wanted to free up money for more 
research grants at a time when scientists 
complained that NIH’s $30 billion bud- 
get couldn’t support enough of their good 
ideas. The 2011 bill was the most recent 



example of NIH’s chronic boom-and-bust 
funding cycles. A 2-year, $10 billion budget 
spike, part of a massive stimulus package 
to help the US. economy recover from the 
2008 global financial meltdown, was end- 
ing, and money was tight. 

The decision to cap salaries sent the 
BRDPI into a tailspin, bringing it below the 
already low GDP PI. In 2008 the biomedical 
index had hit a 20-year peak of 4.7%, more 
than twice the 2.1% rate for the GDP PI. By 
2010 it had fallen somewhat, to 3.0%, but it 
still far outpaced the minuscule 0.9% rate 
clocked by the GDP PI. Then it plummeted 
in 2012, whereas the GDP PI rose to 1.9%. 

Outsiders might have thought the drop 
would be good news. After all, if biomedi- 
cal inflation slows, then NIH 
should be able to stretch its 
limited dollars further. 

But top NIH officials did 
not trumpet the slowdown. 
The former head of NIH’s 
extramural program, Sally 
Rockey was in the habit of 
blogging each year about 
the value of the index. She 
has called it an “important 
[way] ... to measure changes 
to the purchasing power of 
the NIH dollar and make 
projections for future fiscal years.” But in 
a 28 March 2014 posting, she simply noted 
that the 2012 dip “was primarily driven by 
the reduction in the salary cap for princi- 
pal investigators.” 

Nor does a departmental memo an- 
nouncing the record-low 2012 BRDPI 
number include any formal reaction to the 
surprising slowdown. The same memo does 
include a preliminary estimate for the 2013 
BRDPI that shows it exceeding the broader 
index, and the authors appear relieved at 
the prospect of returning to the status quo. 
The 2013 BRDPI figure, they note, although 
“still low by historical standards ... is at 
least once again growing at a higher rate 
than the [GDP] price index.” 

NIH enjoys strong support in Congress, 
and the realization that biomedical infla- 
tion largely tracks salary trends, not the 
sticker price of essential lab equipment and 
supplies, is unlikely to have a major impact 
on policy debates. Still, it may behoove 
biomedical Iobb 5 dsts to think twice before 
citing the cost of high-tech science as a ra- 
tionale for pumping up NIH’s budget. ■ 




percent 

Share of the biomedical 
research price index taken 
up by salaries 
and fringe benefits. 
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MEANS TO AN END 

Cities, states, and provinces are gearing up to halt their AIDS 
epidemics— though the definition of success varies 

By Jon Cohen 



A fter what Shane Ryan calls “a silly, 
stupid weekend” in late April, he 
joined the many other men in San 
Francisco, California, who line up 
six mornings each week and wait 
for the doors to open at Magnet. 
Located in the city’s Castro dis- 
trict, Magnet provides sexual 
health services for gay and bisex- 
ual men. Last year, the center diagnosed 37% 
of the new HIV infections in the city. Ryan, 

226 17 JULY 2015 • VOL 349 ISSUE 6245 



like most of the other men in line, came to 
find out whether his condomless sex had the 
consequence he feared. “Usually, I’m very 
safe, always precautious, but I went a little 
sidetracked that weekend,” Ryan said. 

Ryan, who is 24 years old and grew up in 
Ireland, met with a nurse practitioner, who 
explained that he could immediately start 
taking antiretrovirals (ARVs). This so-called 
postexposure prophylaxis (PEP) might abort 
an infection if indeed he had been exposed to 



HIV, the nurse explained. Ryan began taking 
a pill each day that contains four ARVs. 

In June, when Ryan’s follow-up test came 
back negative, another Magnet nurse prac- 
titioner, Pierre-Cedric Crouch, asked if he 
wanted to start on pre-exposure prophylaxis, 
or PrEP. “Having gone on PEP is an indication | 
that PrEP might be a good thing for you to g 
do,” Crouch said. He explained that the daily g 
PrEP pill contains only two of the four drugs g 
used in PEP and has far fewer side effects. It I 
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After this test showed Shane Ryan’s blood was 
negative for the AIDS virus, Magnet’s nurse 
practitioner Pierre-Cedric Crouch (background) 
offered him anti-HIV drugs to prevent infection. 



is not a substitute for using condoms, he said, 
but it provides “that extra layer of protec- 
tion.” Ryan opted to try it and was directed 
to a PrEP benefits manager who would help 
him get the drug at low or no cost. 

Magnet’s aggressive attempt to prevent 
the spread of HIV to people at high risk is 
part of a groundbreaking citywide initiative 
launched this January that aims to make San 
Francisco the first jurisdiction in the world to 
get transmission down to such low levels that 
it effectively will “end” its epidemic. Specifi- 
cally, San Francisco wants to achieve a 90% 
drop in new infections and deaths by 2020. 
^ “I think we’re going to get to the point where 
< HIV diagnoses are very rare,” says Steve 

0 Gibson, who heads Magnet. 

1 New York state and the Canadian prov- 

0 ince of British Columbia (BC) are also at the 

1 vanguard of the ending AIDS movement. 
^ These three locales have different defini- 
Q tions of what it means to end AIDS, as well 
^ as different approaches, tailor-made to their 
I distinct demographics and politics. All, 
I however, rely on aggressive use of ARVs and 
^ intensive efforts to identify newly infected 
Q people and their partners. Those strategies 



reflect several game-changing findings of 
the past 5 years. 

Researchers now know that if infected 
people take their ARVS and knock down 
the virus to undetectable levels in the blood, 
they rarely transmit it to their sexual part- 
ners. The approach, called treatment as 
prevention, pays special dividends with 
recently infected people, who account for a 
disproportionate amount of spread. Studies 
have also shown that uninfected people who 
take the drugs pre-exposure— PrEP— can 
greatly reduce their own risk of infection. 

Buoyed by these remarkable advances 
against a virus that to date has infected 
76 million people and shortened the lives of 
half of them, the Joint United Nations Pro- 
gramme on HIV/AIDS (UNAIDS) in 2014 set 
an “ambitious treatment target” of 90-90-90 
by 2020: 90% of infected people worldwide 
will know their HIV status, 90% of those will 
receive ARVs without interruption, and 90% 
of those will have no detectable virus in their 
blood. Doing the math, that means 73% of 
the total infected population will have fully 
suppressed viral levels. This treatment-as- 
prevention strategy, even without PrEP, will 
“end the AIDS epidemic as a major global 
health threat by 2030,” UNAIDS predicts. 

A just-released report, “Defeating AIDS— 
advancing global health,” in the 25 June on- 
line issue of The Lancet, is less optimistic. 
“There is an urgent need to do more and 
to do better now,” declares the report, by a 
commission including heads of state, pub- 
lic health leaders, and even actress Charlize 
Theron. It notes that too many locales have 
sluggish responses and little hope of down- 
grading their epidemics to “low-level ende- 
micit}^’ (see sidebar, p. 230). But BC, New 
York, and San Francisco aim to pave the way 
in the ending AIDS movement, which will be 
a central topic at an international HIV/AIDS 
meeting in Vancouver 19 to 22 June. 

JULIO MONTANER, CO-CHAIR of the Van- 
couver meeting and the key architect of 
90-90-90, took a gamble when he and his 
colleagues published an editorial in the 
5 August 2006 issue of The Lancet that ar- 
gued for making ARVs the backbone of pre- 
vention efforts. “We were told by UNAIDS 
that we were medicalizing prevention, and 
that this was irresponsible,” says Montaner, 
who is at the BC Centre for Excellence in 
HIV/AIDS in Vancouver (an offshoot of 
the University of British Columbia). He ac- 
knowledges that he and his colleagues were 
relying more on mathematical models than 
hard evidence that treatment as prevention 
worked. But studies vindicated him. “I’m 
happy we were right.” 

Although BC has not declared an explicit 
goal for “ending AIDS,” Montaner says it is 
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committed to reaching the 90-90-90 target 
by 2020. Nearly seven times the size of New 
York state, the province has only 4.6 million 
residents, an estimated 12,000 of whom are 
infected with HIV, giving it a prevalence rate 
of 0.3%. The epidemic boomed in the early 
1990s in men who have sex with men (MSM). 
When heroin use surged around 1996 in 
downtown Vancouver, HIV infections did 
as well, which led authorities to declare a 
public health emergency the next year. The 
government expanded a long-running meth- 
adone maintenance program, distributed 
free needles, and in 2003 opened the first su- 
pervised injection site in North America. In 
2011, the province began offering ARVs to all 



Treatment cascade 

For the treatment-as-preventwion strategy 
to work, people must know their HIV 
status, receive appropriate medical care, 
and take ARVs regularly such that their 
HIV becomes undetectable. 



■ San Francisco United States 




Diagnosed Linked On ARVs, 

withFIIV to care Undetectable FIIV 



infected people, whereas most of the world, 
concerned about cost and side effects, still 
recommended withholding treatment until a 
person’s immune system was damaged. 

With the widespread use of ARVs, 
AIDS-related illness and death in BC has 
plummeted since 1996. New HIV diagnoses 
have fallen from a high of 681 per year in 
1995 to 262 in 2014, with a huge drop among 
people who inject drugs. But among MSM 
and heterosexuals, the number of new di- 
agnoses has barely budged for several years. 
And although Vancouver has been relatively 
successful, the epidemic is poorly controlled 
in more remote regions of the province. 

In BC as a whole, some 80% of infected 
people know their HIV status, 61% take 
ARVs at some point, but only 51% have un- 
detectable viral levels— the cornerstone of 
the treatment-as-prevention strategy. “Treat- 
ment as prevention has not been pushed to 
the limit,” Montaner says. So the BC center 
and its collaborators have recently stepped 
up efforts to find infected people and make 
sure everyone who starts ARVs stays on 
them so the virus is fully suppressed. 
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Following a surge in HIV infections from sharing needles, Vancouver opened Insite in 2003 for drug users to inject safely. 



As part of its rewed-up effort, BC is one of 
a growing number of places that is exploiting 
DNA sequencing of the fast-evolving virus to 
work out how isolates are related. That al- 
lows epidemiologists to pinpoint hot spots of 
transmission, where interventions can then 
be targeted {Science, 12 June, p. 1188). These 
data have led health authorities to concen- 
trate outreach efforts in the northern, rural 
part of the province, which is home to many 
indigenous Canadians. 

Some criticize EC’s program for not going 
beyond 90-90-90 and offering PrEP to high- 
risk but uninfected people like Shane Ryan, a 
mainstay of efforts in New York and San Fran- 
cisco. “I don’t think we’re going to eliminate 
the epidemic without PrEP,” 
says Susan Buchbinder, who 
directs HIV prevention at the 
Department of Public Health 
in San Francisco. Montaner, 
who notes that Canada’s gov- 
ernment-funded health care 
plan does not reimburse for 
PrEP, is not convinced PrEP 
is needed. But he says he is 
keeping an open mind. If in 
5 years the jurisdictions that 
combine both strategies are 
ahead of us, he says, “I’ll say 
PrEP is the path to take.” 

MARK HARRINGTON and 

Charles King hatched their 
bold plan for ending the 
AIDS epidemic in New York 
in the back of a paddy 
wagon. In 2012, during an in- 
ternational AIDS conference 



in Washington, D.C., the prominent activ- 
ists were arrested outside the White House 
while protesting the Obama administra- 
tion’s HIV/AIDS policies. Based in New York, 
Harrington runs Treatment Action Group, 
an HIV/AIDS think tank, and King is head of 
Housing Works, which helps find homes for 
people living with the virus. Convinced that 
the administration’s plan was not aggressive 
enough, they began brainstorming while en 
route to their jail cell about how they could 
do better. 

With support from HIV/AIDS research- 
ers at Columbia University and the AIDS 
Institute at the New York State Department 
of Health, Harrington and King organized 



community meetings to explore how New 
York could drive down new infections to 
such a low level that the epidemic would 
peter out. “It brought more gravitas to the 
whole thing that it was researchers, advo- 
cates, and policymakers all working to- 
gether to push for this,” says psychologist 
Robert Remien, who heads Columbia’s HIV 
Center for Clinical and Behavioral Studies. 

The idea received an enormous political 
boost in June 2014 when New York Governor 
Andrew Cuomo, a Democrat, announced a 
three-point plan to end the state’s AIDS epi- 
demic by expanding testing, treatment, and 
access to PrER Afew months later, Cuomo set 
up an Ending the Epidemic (ETE) Task Force, 
which includes Harrington, 
King, other community ad- 
vocates, scientists, and gov- 
ernment health workers. In 
April 2015, at a ceremony to 
unveil the group’s blueprint, 
Cuomo said that when he 
announced the goal, people 
thought it was “outrageous.” 
But New York will suc- 
ceed and set an example, 
said Cuomo, who budgeted 
$10 million this year to sup- 
port the ETE project. 

The meaning of “ending 
AIDS” varies from place to 
place, and the “Defeating 
AIDS” report urges the inter- 
national AIDS community to 
agree on “a precise scientific 
and epidemiological defini- 
tion of low-level endemicity.” 
The report defines the end 




On Gay Pride Day in June 2014, New York Governor Andrew Cuomo (D), shown marching 
in a Manhattan parade, announced a plan to end AIDS in the state. 
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as a reproduction number below one, mean- 
ing that each infected person transmits the 
virus to less than one other person on aver- 
age. At that rate, the epidemic will gradually 
fade. No locale has adopted that exact lan- 
guage. Governor Cuomo offered this defini- 
tion: “The end of the AIDS epidemic in New 
York state will occur when the total number 
of new HIV infections has fallen below the 
number of HIV-related deaths.” In 2012, the 
state had an estimated 3000 new infections 
and 1653 AIDS deaths (with an ongoing de- 
bate about how many of those were HFV re- 
lated). The ETE blueprint aims to reduce the 
number of new infections to 
750 in 2020. 

New York has more HIV- 
infected residents, about 
half of whom are MSM, than 
any state. Out of a popula- 
tion of nearly 20 million, an 
estimated 154,000 people 
were living with HFV in 
2012. Of those, 132,000, or 
89%, knew their status, but 
only 68,000, or 44%, were 
on treatment and had un- 
detectable levels of virus, far 
below the UNAIDS target. 

To decrease the gap between 
diagnosis and effective treat- 
ment, which Harrington 
refers to as “the Grand Can- 
yon,” the blueprint calls for 
stepping up surveillance to 
find people who know their 
status but don’t consistently 
use ARVs and to assist them 
with nonmedical challenges 
that might get in the way 
of treatment, such as find- 
ing housing and jobs. It 
will also provide incentives 
to both care providers and 
patients to help people stay 
on treatment. 

Meanwhile, 22,000 New 
Yorkers don’t even know 
they are infected. To reach 
them, the state will ramp up 
routine testing in places like 
emergency rooms and send 
out more mobile testing units to high-risk 
populations, including MSM, transgender 
people, new immigrants, the homeless, and 
the mentally ill. ETE is launching a statewide 
education campaign about PrEP— which has 
been approved by the US. Food and Drug 
Administration for 3 years but has been slow 
to catch on, in part because some uninfected 
people worried that it would brand them as 
promiscuous and reckless. New York is also 
creating programs to help people access 
PrEP drugs. Medical record “autopsies” of 
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people who die from AIDS will try to identify 
how they slipped through the cracks. 

The state’s portion of the price tag for put- 
ting 68,000 more people on treatment by 
2020 is $2.25 billion, according to a recent 
analysis done by Treatment Action Group 
and Housing Works. Additional housing 
support for the 12,000 or so HIV-infected 
people who are homeless or poor would add 
as much as $720 million to the bill. But im- 
proved care and decreases in new infections 
would reduce the load on the health care 
system by enough to save New York nearly 
$8 billion, the analysis finds. 



ON WORLD AIDS DAY in 2013, Diane 
Havlir of the University of California, San 
Francisco (UCSF), moderated a forum for 
the local community entitled “Getting 
to Zero in San Francisco: How Close Are 
We?” After experts described their city- 
wide efforts, Havlir, a clinician who heads 
the HIV/AIDS division at San Francisco 
General Hospital, invited questions from 
the audience. 

“You guys are all doing great things,” a 
man said. “Are you working together?” 



Havlir was taken aback. “At that moment 
we recognized that we were really not work- 
ing together,” she said recently. That discus- 
sion sparked the Getting to Zero Coalition, 
which bands together UCSF, the city’s health 
department, HIV/AIDS advocacy groups, 
and major hospitals. In January, the coali- 
tion released a strategic plan for ending the 
city’s epidemic. 

Although San Francisco has just 20% the 
population of BC, it is home to more HIV- 
infected people— an estimated 15,901 at the 
end of 2013, an adult prevalence of about 
2%. One of the first cities to be hard-hit by 
HIV, San Francisco early 
on developed a strong re- 
sponse by combining the 
research and clinical re- 
sources of UCSF and the lo- 
cal health department. The 
city’s epidemic surveillance 
is sophisticated and granu- 
lar, including mapping of 
viral load levels by neigh- 
borhoods, which reveals 
where people are not con- 
trolling their infections. In 
2010, San Francisco became 
the first jurisdiction to rec- 
ommend immediate treat- 
ment for all HIV-infected 
people. Three years later, 
San Francisco General Hos- 
pital went a step further 
and launched a RAPID pro- 
gram that starts people on 
treatment the very day they 
are diagnosed, if possible. It 
also sends outreach teams 
to the streets to search for 
patients who miss appoint- 
ments {Science, 13 July 2012, 
p. 175). Researchers at UCSF 
and the health department 
pioneered PrEP studies. 

Exceptional as San Fran- 
cisco’s response has been, 
it also has serious short- 
comings, which the stra- 
tegic plan confronts with 
three “signature initiatives”: 
scale up RAPID citywide, 
teach people about PrEP and make it easy 
to get— a la Shane Ryan’s visit at Magnet— 
and work more diligently to keep people tak- 
ing their ARVs. Testing is a not a big issue, 
as some 94% of infected people in the city 
know their status. But whereas 89% of those 
people see a specialist within 90 days, only 
63% start on ARVs and achieve undetectable 
viral loads for prolonged periods. The Get- 
ting to Zero Coalition hopes to end AIDS by 
bringing that 63% up to 90% by 2020— an 
even more ambitious goal than the 73% set 
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New York City’s hot spots of spread 

With genetic sequences of HIV from different people, researchers can 
identify how the virus moves between individuals (circles). Mapping these 
transmission clusters can help target treatment and prevention efforts. 
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No end in sight 

By Jon Cohen, in Tijuana, Mexico 

O n 6 December 2013, Pedro Robles 
spent 14 hours in an ambulance 
being driven up Mexico’s Baja 
California Peninsula. The 51-year- 
old man was not rushed north for 
emergency medical care. Time was not of 
the essence. 

Robles had an advanced case of AIDS, 
and he was being driven 1127 kilometers 
north from his home in Loreto to 
Albergue Las Memorias A.C. in Tijuana. 

A nongovernmental organization (NGO) 
arranged the trip, because Las Memorias 
is the only AIDS hospice on the entire 
Baja California Peninsula, Robles was 
broke, and Tijuana held out the remote 
hope that someone there could navigate 



the medical bureaucracy and maybe save 
his life. But Las Memorias itself, which 
also serves as a drug rehabilitation center 
and is largely run by its residents, has 
no trained medical staff. And although 
Las Memorias did what it could to make 
Robles comfortable, Tijuana ultimately 
failed him: He died 6 days later without 
ever having seen a doctor. 

As the dream of ending AIDS catches 
hold in a growing number of locales (see 
main story, p. 226), Tijuana is hardly 
anomalous: Many places are still strug- 
gling to provide basic treatment and 
prevention services. Of course, people still 
die from AIDS in wealthy countries like 
the United States, which is visible from 
downtown Tijuana, but appropriate care 
is so readily available that AIDS hospices 
shut their doors years ago. 

Like Tijuana, too many locales appear to 
be “running at a standstill” and are saddled 



by “poor strategy, absence of leadership, 
or inadequate resources,” laments a promi- 
nent commission in a report in The Lancet 
last month, “Defeating AIDS-advancing 
global health.” 

The drop in HIV infections and AIDS- 
related deaths worldwide over the past 
dozen years has been impressive, the 
report says. But without a “massive and 
rapid expansion of a comprehensive AIDS 
response,” the global toll— still more than 
a million new infections and deaths each 
year— will worsen again over the next 
5 years, and the world will fail to reach 
the United Nations goal of “ending AIDS 
as a public health threat” by 2030. ^ 

Mexico is not particularly hard-hit by | 
HIV. In 2014, UNAIDS estimated the coun- ^ 
try had 190,000 infected people, which is § 
an adult prevalence of 0.2%— lower than | 

in the United States. The government o 

offers free antiretrovirals (ARVs) and, since i 
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November 2014, has recommended that 
all HIV-infected people receive them as 
soon as diagnosed. “When you look back 
to what we were doing 10 years ago, we 
are really, really better,” says Carlos Magis- 
Rodriguez of the National Center for HIV/ 
AIDS Prevention and Control in Mexico 
City. But he acknowledges that Tijuana 
and other cities in Mexico are struggling. 

Tijuana has what is known as a 
“micro-hyperepidemic.” Overall HIV 
prevalence in Tijuana is 0.6%— the same 
as the United States. But the rate is 
soaring in high-risk groups. In women 
who sell sex, prevalence jumped from 
2% in 2003 to 6% by 2012, according 
to recent studies by researchers from 
the University of California, San Diego 
(UCSD), whose team includes Mexico- 
based colleagues. Clients of these sex 
workers had a prevalence of 5%, they 
found. HIV prevalence is also about 
5% in the many people in Tijuana who 
inject drugs. Preliminary studies of men 
who have sex with men and transgender 
people suggest about 20% are infected. 

In theory, Tijuana should be able to 
rein in its concentrated epidemic by 
taking advantage of recent advances. 

Key among them is the 2011 demonstra- 
tion that people who fully suppress their 
HIV levels with ARVs rarely spread the 
virus to their sexual partners. But this 
treatment-as-prevention strategy has not 
gained much traction in Tijuana. UCSD 
behavioral health scientist Laramie 
Smith recently pooled data from six stud- 
ies of nearly 200 HIV-infected people in 
Tijuana and found that only about half 
even knew they had the virus. Tijuana 
offers free HIV testing through NGOs 
and government-funded clinics, yet no 
plan is in place to regularly test high-risk 
people at venues where they hang out, 
like gay bars or the red light district. 

Those who do learn they’re infected 
rarely get treatment. In Smith’s study, 
only 11% received related medical care, 
and only 3.66% began taking ARVs. The 
federally sponsored HIV/AIDS clinic, 
CAPASITS, provides free ARVs, but it 
is located far from the downtown area 
and is difficult for many people to reach. 
Tijuana is also a hub for migrants, 
including many deportees from the 
United States, and some do not have 
the documents required to receive help 
at CAPASITS. And the services there 
fall short of those in developed coun- 
tries: CAPASITS, for example, must ship 
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patient blood samples to Mexico City for 
measurements of CD4 l 5 miphocyte counts 
and HIV levels. 

Jose Luis Burgos, a Tijuana-based 
clinician who works with the UCSD team, 
says a key problem is that the patient 
load in Tijuana outstrips the availability 
of qualified HIV/AIDS doctors. Burgos 
contends that Tijuana could train primary 
care physicians to diagnose and treat HIV/ 
AIDS patients. “You need to demystify 
HIV care,” he says. CAPASITS, he notes, 
has only three doctors and treats some 
1000 patients. “What kind of care can you 
expect from three providers?” 

Paradoxically, Mexico’s rising economic 
status is hampering the fight. A 2011 grant 
from the Global Fund to Fight AIDS, 
Tuberculosis and Malaria enabled two 
Tijuana NGOs to launch mobile needle- 
exchange units. But the grant ended in 
2013 when Mexico achieved upper middle 
income status and became ineligible for 
Global Fund support. Tijuana’s needle- 
exchange programs shriveled overnight. 
“Mexico is supposed to be upper middle 
income, but the border isn’t,” Burgos says. 

The UCSD team soon documented a 



40% increase in needle-sharing among 
a group of users that it has closely fol- 
lowed. That undermines other efforts to 
reach people with HIV, says UCSD epide- 
miologist Steffanie Strathdee, who leads 
the binational research program with her 
psychologist husband Thomas Patterson. 
“The sad thing” is when drug users come 
in for needle exchange, she says, “they 
have an opportunity to get HIV testing or 
a referral to a drug treatment program.” 
Strathdee hopes the group’s extensive 
research will draw attention to the prob- 
lems and the opportunities in Tijuana. 
“It’s entirely possible to end the AIDS 
epidemic in Tijuana,” she says. And if 
Tijuana can do it, so can much of the rest 
of the world. ■ 



Reporting for this story was supported in 
part by a grant from the Pulitzer Center 
on Crisis Reporting. 



by 90-90-90. The city’s ending AIDS targets 
also include reducing new HIV diagnoses 
from 371 in 2013 to 37 in 2020 and HlV-re- 
lated deaths from an estimated 91 to eight. 

“The biggest challenges are reaching 
into marginalized populations that are not 
getting the services they need,” says the 
health department’s Buchbinder— transgen- 
der people, drug users, African-Americans, 
and the young. Getting to Zero is massively 
expanding the use of “navigators” assigned 
to infected people to help address barriers 
that interfere with care, such as substance 
abuse, food insecurity, homelessness, and 
violence. A new program will better coordi- 
nate medical records at different providers 
to help identify patients who are slipping 
through the net. And HIV-infected patients 
deemed at high risk of not taking ARVs will 
receive extra check-in phone calls and re- 
minders for appointments. 

Rigorous evaluation is critical for the cam- 
paign to succeed, said Havlir at a coalition 
meeting in June. Getting to Zero is raising 
money specifically to track the program’s 
performance, using novel metrics like as- 
sessing the impact of PrEP on new infection 
rates. “This is not just talking, talking, talk- 
ing,” Havlir said. “This is about action.” 

THE DRIVE TO END AIDS is spreading 
worldwide, and there is even something of 
a good-natured race to be first. Washington, 
D.C., New South Wales in Australia, and Bra- 
zil are now in the running as well, and San 
Francisco has attracted intrigued delegations 
from Amsterdam, France, and the White 
House’s Office of National AIDS Policy. 

From his office at the London School of 
Hygiene & Tropical Medicine, which he di- 
rects, epidemiologist Peter Piot is watching 
these efforts with interest— and some skep- 
ticism. “It’s very important that these proj- 
ects proceed and that we learn from them,” 
says Piot, who chaired the “Defeating AIDS” 
commission and formerly headed UNAIDS. 
But he cautions that the intensive efforts 
in BC, New York state, and San Francisco 
must continue indefinitely. “These three 
examples are not North Korean types of 
islands— there will be constant reintroduc- 
tion of the virus,” he notes. 

Ending the global spread of HIV will ul- 
timately take a vaccine, Piot says, stressing 
that treatment as prevention packs a lim- 
ited punch. He points to a mathematical 
model in “Defeating AIDS” that found that 
even if the world achieves the UNAIDS 90- 
90-90 goal in 2030, hundreds of thousands 
of new HIV infections and deaths will still 
occur each year. “We’ll have to see whether 
these three places can end AIDS as a public 
health threat,” Piot says. “But that doesn’t 
mean one shouldn’t try.” ■ 
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be upper middle income, 
but the border isn’t.” 

Jose Luis Burgos, 
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PERSPECTIVES 

CHEMISTRY 

Ji'ackit^ the merry 
dance (tf nanoparticles 

Electron microscopy provides atomic-resolution 
structures of nanoparticles in solution 

By Christian Colliex 




A triumph of electron microscopy. Park et al. use a specially equipped electron microscope to determine the 
atomic structure of individual platinum nanoparticles in solution. 



H anoparticles containing a few tens 
or hundreds of atoms have applica- 
tions in an ever-expanding range of 
areas, from electronics to catalysis 
and biological sensors. This versatil- 
ity stems from the high sensitivity 
of nanoparticle properties to size, chemical 
composition, and shape, which are largely 
determined by the synthesis route by which 
they are produced. Nanoparticles grown in 
solution, termed “colloidal nanoparticles,” 
are of particular interest because they may 
be used in many applications, including as 
biosensors (i). On page 290 of this issue. 
Park et al. (2) report high-resolution struc- 
tures of nanoparticles in solution deduced 
from electron microscopy This spectacular 
result relies on the convergence of recently 
developed techniques from both the physi- 
cal and life sciences and heralds a new era of 
high-resolution studies in solution. 

Colloidal particles made of noble metals, 
particularly gold or platinum, have been 
widely studied for applications in catalysis 
and photocatalysis. As early as 20 years ago, 
Ahmadi et al. showed that the shape of col- 
loidal Pt nanoparticles could be controlled by 
changing the composition and concentration 
of capping materials and metal cations in the 
solution (5). To image the particles, they de- 
posited nanoparticles on an amorphous car- 
bon layer. By tilting the resulting specimen, 
they found that nanoparticles with square 
shapes were cubic and nanoparticles with 
triangular shapes were tetrahedral. However, 
they could not resolve the three-dimensional 
(3D) atomic structure or study the particles 
directly in solution. This is what Park et al. 
now achieve for individual nanocrystals. The 
work follows similar progress realized a few 
years earlier in a gaseous environment (4). 

The success of the study by Park et al. re- 
lies on three technological developments. 

The graphene liquid cell (GLC), introduced 
previously by the same group (5), maintains 
nanometer-scale objects in a liquid environ- 
ment between two layers of free-standing 
graphene. Highly sensitive direct electron 
detectors enable the production of movies 
with about 50 frames per second. To analyze 
their data, the authors adapted the theory I 
and associated software tools created by mo- | 
lecular biologists to reconstruct the struc- ^ 
tures of single biomolecules. | 

In the approach used by Park et al., mov- | 
ies follow individual Pt nanocrystals freely ^ 
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rotating in solution (see the figure, panel A). 
These nanoparticles may thus be observed 
in many different orientations, providing 
the required number of different projec- 
tions for the reconstruction algorithm. In 
molecular biology (see the figure, panel B), 
multiple copies of biomolecules with identi- 
cal structures are flash-frozen in a vitrified 
thin layer of ice (6). One micrograph covers 
a field with hundreds of particles viewed at 
different orientations. These images of in- 
dividual molecules 5 deld a collection of 2D 
projections that are then used to generate a 
single 3D structure. 

A Nanoparticle 

Electrons 







Freely rotating nanoparticle in GLC 



Yuk et al. first used a GLC to image with 
atomic resolution a few critical steps (coales- 
cence, reshaping, and faceting) in the growth 
of Pt nanocrystals (5). Since then, GLCs have 
been used to follow the real-time trajectories 
of Au nanocrystals linked to double-stranded 
DNA fragments. The observations confirmed 
that sample-substrate interactions between 
the liquids and the inert graphene layers are 
minimized (7). The authors then deduced 
the 3D motion from a series of 2D images re- 
corded while the complex nanostructure was 
rotating. This was clearly a key step on the 
way to the present demonstration. Recently, 
Wang et al. used GLCs to encapsulate tiny 
water volumes containing ferritin molecules, 
which consist of a core of iron (hydro)-oxide 
and a protein shell (8). They then used elec- 
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tron energy-loss spectroscopy (EELS) to map 
the spatial distribution of Fe, O, and N ele- 
ments and also reported an analysis of the 
iron valence state in ferritin and in solution. 
The results show that GLCs provide a poten- 
tially useful route for monitoring biochemi- 
cal reactions in situ. 

As for the latest generation of direct 
electron-detection camera technology, it 
has contributed to the rapid improvement 
of high-resolution structures reconstructed 
from cryo-electron microscopy (cryo-EM) 
data. Their signal-to-noise ratio is improved 
in the movie frame mode, allowing refined 

B Biomolecule 

Electrons 



compensation of sample movements and 
reducing the number of particles neces- 
sary for reconstructing the structure of a 
biomolecule (9). Very recently, Bartesaghi 
et al. have demonstrated the structure of a 
ligand-protein complex at 0.22-nm resolu- 
tion {10), quite close to that demonstrated 
by Park et al. 

The algorithms used to reconstruct 3D 
structures from extensive sets of 2D projec- 
tions have been developed over the past two 
decades, mostly for applications in biology. 
One basic assumption is that the population 
of imaged particles is uniform. However, this 
assumption is generally not fully satisfied be- 
cause biomolecules are by nature not rigid. 
Additional computing techniques are thus 
used to sort the images into different classes 
corresponding to different conformations. 
This issue is not as relevant in Park et aZ.’s ap- 
proach, where the same particle is viewed in 
different orientations. Furthermore, metallic 



nanocrystals can generally be considered as 
rigid, as long as the electron beam does not 
cause any transformations. 

The reconstruction of 3D atomic struc- 
tures of individual metallic nanoparticles in 
a vacuum or embedded in a solid matrix has 
benefited from advances in electron micros- 
copy techniques, such as aberration correc- 
tion for higher-resolution images, annular 
dark-field scanning transmission mode for 
counting the number of atoms in each col- 
umn, and the development of dedicated algo- 
rithms for tomography. Successful examples 
include the 3D atomic structures of a gold 
nanoparticle in vacuum {11, 12), 
a silver nanocrystal epitaxially 
embedded in an aluminum ma- 
trix {13), and a gold nanoparti- 
cle of 68 atoms surrounded by a 
thiol molecular cage that makes 
it water soluble {14). Park et al. 
now investigate two distinct 
individual nanoparticles in 
the liquid phase. Their density 
maps reveal that each particle 
consists of three nanocrystals 
joined together nons 5 mimetri- 
cally. The authors anal 3 ^e the 
local organization of the inter- 
faces between these nanocrys- 
tals and compare the results 
with molecular dynamics calcu- 
lations of the free energies for 
grain boundary formation. 

As Park et al. show, the 
structure and d 3 mamics of 
individual members of an in- 
homogeneous ensemble of 
nano-objects in a fluid can 
now be determined with spe- 
cifically equipped modern elec- 
tron microscopes. This advance 
provides a tool for exploring the growth, 
self-organization and reactivity of nanopar- 
ticles in liquids containing different types 
of ions, salts, or molecules. ■ 
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Similar biomolecules randomly oriented in ice 




Projected 2D image Reconstructed 

3D object 



Three-dimensional structures from electron microscopy. (A) Park etai. take many images of an individual platinum nanoparticle as 
it rotates freely in the solution contained in a GLC. They then combine these images of the same particle to determine its 3D structure. 
The method was adapted from cryo-EM (B), where 2D proejctions of numerous copies of the same biomolecule frozen in a vitrified ice 
layer are used to determine the 3D structure of the biomolecule {15). 
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PHYSICS 

Mimetic bubbles with a twist 

Individual skyrmionic bubbles can be generated and moved at room temperature 




By Kirsten von Bergmann 

P resent-day hard disk 
drives use magnetic bits 
that are read out by mov- 
able read heads. To cir- 
cumvent the inherent 
fragility of such a me- 
chanical construction, and to 
exploit the third dimension for 
increased storage density, Par- 
kin et al. proposed a racetrack 
memory device (i). The general 
concept is that the information 
is encoded in a localized magne- 
tization configuration that can 
be driven through the material 
with electrical currents and transported to a 
stationary read head. Recently, interface-in- 
duced skyrmions, which are circular particle- 
like magnetic objects (see the figure, panel 
A), have been envisioned as ideal candidates 
for future racetrack memory-type applica- 
tions (see the figure, panel B) {2, 3). On page 
283 of this issue, Jiang et al. (4) have made 
progress toward realizing such a device ar- 
chitecture. They report on the generation 
and movement of individual skyrmionic bub- 
bles at room temperature, accomplished by 
exploiting two different spin-orbit coupling- 
related effects. 
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Magnetic skyrmions are localized non- 
collinear states with a twist, possessing the 
key characteristic that they are topologically 
protected. That is, they effectively constitute 
a knot in the magnetic texture that adds to 
their stability with respect to a trivial fer- 
romagnetic state. The application of spin 
transfer torques by the injection of lateral 
or vertical electrical currents can induce a 
motion of magnetic skyrmions at surpris- 
ingly low current densities (5, 6). For such 
topologically distinct states, the direction 
of motion depends on the direction of the 
twist, manifested in the rotational sense of 
the spin canting. 

Twisted magnetic states are useful for 
applications only if the direction of the 
twist is unique, thus enabling control over 



the movement via the direc- 
tion of the injected current. 
To achieve this, the twist must 
be intrinsic to the system. A 
magnetic interaction that fa- 
vors one rotational sense over 
the other is the Dzyaloshin- 
skii-Moriya (DM) interaction, 
which arises as a result of spin- 
orbit interaction and can occur 
in systems with broken inver- 
sion symmetry as, for example, 
in chiral crystal structures 
(7, 8) or at interfaces (9-II). 
For the interface-induced sky- 
rmions, the rotational sense 
is determined by the specific 
material, and when realized on a large scale 
by standard epitaxial growth processes, 
it promises an easy implementation into 
present state-of-the-art multilayer-based 
device architectures. 

Until now the only experimental observa- 
tions of interface-induced skjn-mions were 
in perfectly ordered, atomically thin lay- 
ers, which served as model systems (10, 11). 
Tuning of the sk 5 Tmion size from 1 to 5 nm 
was demonstrated. By using local currents 
from a scanning tunneling microscope tip, 
isolated skyrmions were written and de- 
leted individually (11). Future applications 
will rely on other, more practical materials 
in which the DM interaction governs the 
magnetic texture, inducing a defined twist 
of the magnetic state. 




Skyrmions for future storage devices. (A) Sketch of a magnetic skyrmion, a knot in the magnetization with a defined twist. (B) Coherent movement of skyrmions along a 
track with lateral currents inspired by (3). (C) Generation of individual skyrmions by inhomogeneous spin-orbit torques (small blue arrows) behind a geometrical constriction, 
as demonstrated by Jiang et al. (4). (D) Annihilation of skyrmions by small currents that drive the magnetic object out of the magnetic material. (E) Stationary read head; the 
information is encoded in the magnetic texture, which is moved to the readout by small lateral currents along the track. 
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Is biodiversity good for 
your health? 

Disease incidence is often lower in more diverse 
communities of plants and animals 



Jiang et al. report on the observation of 
interface-induced skyrmionic bubbles at 
room temperature in a magnetic layer grown 
epitaxially on a material with large spin-orbit 
coupling. The role of the substrate is twofold: 
First, it provides the interface at which the 
DM interaction is induced; and second, by 
driving a lateral current through the sub- 
strate, the charge is converted to a vertical 
spin current due to the spin Hall effect (12). 
The resulting spin-orbit torques act on the 
magnetic layer and induce a movement of 
the micrometer-sized skyrmions. All of them 
move in the same direction, which means 
they all have the same twist, imposed by the 
DM interaction. 

The generation of individual skyrmions 
is realized by inhomogeneous spin-orbit 
torques occurring behind a geometric con- 
striction (see the figure, panel C). Jiang et al. 
propose an intriguing model of how the bub- 
bles are pinched off from a magnetic stripe 
domain driven through the constriction, and 
draw an analogy to the formation of soap 
bubbles. Thus, for the writing of a skyrmion 
into a track, a perpendicular geometry could 
be used, and with a lateral movement in the 
same direction, it should be possible to delete 
single magnetic objects by moving them out 
of the track (see the figure, panel D), unifying 
all relevant operations in one device. 

With the goal of achieving a skyrmion 
racetrack in sight, there are several issues 
that still need to be resolved. The precision 
of the generation and the degree of control 
over each magnetic skyrmion need improve- 
ment. Jiang et al. demonstrate the generation 
of many skyrmionic bubbles at a time, and in 
their sample the movement is strongly influ- 
enced by local pinning due to the inhomoge- 
neity of the film. Furthermore, the size of the 
skyrmions needs to be reduced to be compet- 
itive with present storage densities. Future 
research will show whether nanometer-sized 
skyrmions are possible in materials relevant 
for applications. However, such a small size 
of a skyrmion will not only limit the meth- 
ods to investigate them but also challenge 
the mechanism and design of an appropriate 
readout device (see the figure, panel E). ■ 
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By Felicia Keesing and Richard S. Ostfeld 

D n a floodplain of the River Saale near 
Jena, Germany, grassland plants are 
naturally bombarded by spores of 
pathogenic fungi. But whether or not 
those fungi cause infection turns out 
to be largely about the neighborhood: 
Plants on highly diverse experimental plots 
have much lower levels of infection than 
plants grown in monoculture (i) (see the 
photos). The pathogens, it appears, are less 
likely to encounter their optimal host on the 
more diverse plots, which reduces disease 
prevalence and incidence. This protective 
effect of diversity has been found in many 
studies, not just for plants but also for dis- 
eases afflicting humans and wildlife. It has 



remained unclear, however, whether this ob- 
servation holds generally {2, 3). In a recent 
paper, Civitello et al. addressed this question 
in a rigorous meta-analysis of diversity-dis- 
ease relationships (4). 

Civitello et al. examined 202 results from 
studies of 61 parasites infecting both plants 
and animals. They found a clear and consis- 
tent reduction in disease prevalence when 
diversity was high. The effect was as strong 
for parasites that infect humans as for those 
that infect only wildlife and was seen both 
for microparasites (pathogens such as bac- 
teria and viruses) and for macroparasites 
(such as trematodes and nematodes). The 
protective effect of diversity for human dis- 
eases is particularly important because it re- 
futes the results of a previous meta-analysis 




The benefits of diversity. Civitello etal. have performed a meta-analysis of studies of 61 parasites infecting both 
animals and plants. The results show that disease prevalence is often higher in less diverse systems (top, barley 
monoculture) than in more diverse systems (bottom). 
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Effects of plant diversity on fungal pathogens in Jena, Germany. Plots with higher host diversity generally had a higher diversity of pathogens (A) but also lower levels of 
overall infection (B). Drawn from data provided in (J). Note that thex-axis is on a log scale. 



of only six parasites, which found no consis- 
tent effect of diversity (5). Civitello et at. also 
found that the effect of diversity was equally 
strong for both experimental and compara- 
tive studies. This is a key result, because al- 
though it has been known for some time that 
diversity can reduce disease in carefully con- 
trolled and constructed experimental com- 
munities, it has been less clear whether the 
effect translates into natural systems. 

The widespread negative effect of high di- 
versity on disease transmission documented 
by Civitello et at. suggests a consistent un- 
derlying mechanism that applies to diseases 
of humans, wildlife, and plants. One part of 
that mechanism has been hinted at in pre- 
vious work: The best hosts for multihost 
pathogens are often abundant, widespread, 
and resilient species {6-8). A recent study 
by Han et al. {9) provides strong evidence 
for this pattern. The authors set out to de- 
termine what characteristics make some 
species good reservoir hosts of diseases that 
infect humans. Working with a database of 
2277 rodent species and 66 pathogens, they 
used machine-learning algorithms to explore 
what aspects of host life history, physiology, 
behavior, ecology, and biogeography are asso- 
ciated with pathogens that spill over into hu- 
man populations. Several patterns emerged. 
Good reservoir hosts have broad geographic 
ranges that contain comparatively few other 
species. They also reach maturity at a young 
age and have a short gestation period and a 
large litter size, all characteristics of what is 
often called a fast life history. 

Why might species with fast life histories 
be good hosts for pathogens? Two hypoth- 
eses have garnered some support. First, 
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parasites and pathogens that infect multiple 
species might evolve to exploit those hosts 
that they are most likely to encounter in na- 
ture— in other words, the most widespread, 
abundant hosts. Species that are widespread 
and abundant often have fast life histories 
(10). The second possibility is that fast-living 
hosts tend to be less resistant to, or more 
tolerant of, infection because of how they al- 
locate their immune defenses (11). 

Whatever the causes of these patterns, 
the consequences are intriguing. Species 
with fast life histories are often resilient to 
disturbance and are thus likely to persist 
in the face of environmental stressors that 
cause other species to decline or disappear 
{12). Together, these two patterns— the resil- 
ience and ubiquity of species with fast life 
histories and their odds of being high-qual- 
ity hosts for parasites and pathogens— may 
interact to produce the diversity-disease re- 
lationship. As biodiversity is lost from eco- 
logical systems, the species most likely to 
persist may tend to be those most likely to 
harbor and transmit pathogens at high rates. 

But won’t ecological communities with a 
higher diversity of hosts also have a higher 
diversity of parasites and pathogens? And 
might a higher diversity of parasites and 
pathogens counteract the disease-suppress- 
ing effects of high host diversity? These 
questions have been at the heart of critiques 
of the diversity-disease relationship {3, 13). 
An answer may be emerging. Back on the 
German floodplain, Rottstock et al. {!) found 
that the diversity of fungal pathogens was 
indeed higher in the plots with higher plant 
diversity, but the level of infection was lower 
(see the chart). Even the prevalence of co- 
infection— the simultaneous infection of 
a host with more than one pathogen— was 
lower where host and pathogen diversity 
were high. If this pattern of reduced disease 



with increased parasite diversity occurs in 
other systems, it would help to resolve what 
some have seen as a paradoxical relationship 
between diversity and disease. 

Many fascinating questions remain about 
the relationship between diversity and dis- 
ease, but Civitello et alls meta-analysis dem- 
onstrates that diversity frequently reduces 
disease. At the same time, Han et al. {9) have 
identified traits in rodents that strengthen 
the connection between life history and the 
probability that a species will be a reser- 
voir for human pathogens. Case studies to 
evaluate diversity effects in specific disease 
systems will remain important, as will the 
pursuit of mechanisms that underlie the 
general pattern. Perhaps the most impor- 
tant question that remains is about the ap- 
plication of this knowledge to public policy. 
Should health protection be added to the 
long list of ecosystem services provided by 
biodiversity (2)? The meta-analysis by Civi- 
tello et al. suggests that it should. ■ 
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Neutrophil-macrophage communication in 
inflammation and atherosclerosis 

Neutrophils may license macrophages to respond to cholesterol crystals and drive 
inflammation that aggravates atherosclerosis 



By Matthias Nahrendorf 
and Filip K. Swirski 

N eutrophils and monocyte-derived 
macrophages are myeloid cells, with 
a shared hematopoietic ancestry, that 
pursue both common and distinct 
immune functions. In response to 
tissue damage or infection, neutro- 
phils infiltrate first, followed by monocytes 
(i). Through a series of events involving the 
release of proinflammatory products, both 
cell t 5 ^es seek to neutralize danger, and 
if successful, inflammation resolves. But 
when stimuli persist and inflammation is 
not resolved, neutrophils and monocytes 
continue to accumulate, presumably in- 
flicting the irreversible damage that char- 
acterizes chronic inflammation. Whether 
neutrophils and monocytes act together to 
fuel the inflammatory cascade in those cir- 
cumstances is still poorly understood. On 
page 316 of this issue, Warnatsch et al. (2) 
report that neutrophils prime macrophages 
for inflammatory responses that aggravate 
atherosclerosis. Neutrophil extracellular 
traps (NETs) underlie the communication 
between these two immune cell types. 

A distinguishing feature of neutrophils is 
their ability to expel cytosolic and nuclear 
material, a suicidal act (NETosis) that en- 
snares extracellular microbes. NETs are 
web-like structures containing DNA, his- 
tones, neutrophil elastase, and myeloperox- 
idase (3) (see the second figure). According 
to Warnatsch et al., NETs can be harmful in 
the setting of inflammatory atherosclerosis. 
In this case, extracellular cholesterol crys- 
tals interact with neutrophils to trigger the 
release of NETs, which prime macrophages 
to produce a precursor form of the inflam- 
matory cytokine interleukin-lp (pro-IL-lp). 
In addition to activating NETosis, choles- 
y terol crystals bind to the cell surface pro- 
I tein CD36 on macrophages, are internalized 
I via endocytosis (4), and activate a signaling 
I complex called the inflammasome (5). The 

5 inflammasome promotes maturation of 
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IL-lp via cleavage of pro-IL-lp with cas- 
pase-1. Thus armed, macrophages present 
in atherosclerotic plaques carry out inflam- 
matory and plaque-destabilizing functions. 
Ultimately, these processes induce throm- 
botic complications that interrupt arterial 
blood supply to downstream tissues, caus- 
ing myocardial infarction and stroke (6). 

Warnatsch et al. identified NETs accru- 
ing adjacent to cholesterol crystals and 
macrophages in atherosclerotic lesions of 
apolipoprotein E-deficient {Apoe~'~) mice. 
In this model of atherosclerosis, the ab- 
sence of ApoE impairs reverse cholesterol 
transport, resulting in hypercholesterol- 
emia, leukocytosis, and plaque formation 
in the vessel wall. Incubation of human 
neutrophils with cholesterol crystals in 
vitro triggered NETosis— that is, neutro- 
phils died while expelling nuclear material, 
which includes the well-known danger sig- 
nal DNA. To determine whether NETosis 
contributes to atherogenesis, the authors 
engineered Apoe~i~ mice that also lack neu- 
trophil elastase and proteinase-3 {Apoe~^~l 



Ela2~i~/Prtn3~'~), two serine proteases that 
are secreted during inflammation to de- 
stroy pathogens and host tissue, and local- 
ize to NETs. These triple mutant mice could 
not support NETosis. When consuming a 
diet high in fat and cholesterol, Apoe ' l 
Ela2~'~/Prtn3~i~ mice developed dramati- 
cally smaller atherosclerotic lesions com- 
pared to Apoe~i~ control animals, despite 
similar lipid concentrations and leukocyte 
counts in blood. In the triple mutant mice, 
the authors found no NETs, lower systemic 
IL-lp concentration, and fewer lesional 
T cells that produce IL-17, a cytokine that 
propagates inflammation by promoting the 
expression of multiple inflammatory cyto- 
kines (although its role in atherosclerosis 
is not firmly established). Moreover, DNAse 
treatment of Apoe~'~ mice resulted in the 
same small lesion phenotype as seen in the 
Apoe~'~/Ela2~i~/Prtn3~i~ mice, further sup- 
porting DNAs atherosclerosis-aggravating 
role. Warnatsch et al. also noted that incu- 
bation of human monocytes first with NETs 
and then with cholesterol crystals resulted 




Fueling inflammation. Cholesterol crystals and NETosis collaborate to activate the inflammasome in 
murine atherosclerosis. 



SCIENCE sciencemag.org 



17 JULY 2015 • VOL 349 ISSUE 6245 237 



Published by AAAS 



INSIGHTS I PERSPECTIVES 



GLOBAL NUTRITION 

Metrics for 

land-scarce 

agriculture 

Nutrient content must 
be better integrated into 
planning 



in strong inflammasome activation. The 
observations suggest a two-step inflamma- 
some activation process (see the first figure) 
in murine plaque macrophages in which 
NETS prime a macrophage for inflamma- 
some activation and pro-IL-lp production. 
Cholesterol crystals then induce lysosomal 
damage, which is sensed by the macro- 
phage inflammasome called NOD-like re- 
ceptor family, pyrin domain containing 3 



(NLRP3). NLRP3 activates caspase-1, which 
subsequently leads to proteolytic cleavage 
and release of IL-lp (5). 

Cholesterol crystal-induced NETosis 
as a macrophage inflammasome stimu- 
lator adds to the growing knowledge on 
neutrophil activity in atherosclerosis, 
including the role of neutrophils in aug- 
menting monoc 5 de recruitment (7) and 
the pro-thrombotic actions of NETs in the 
setting of plaque erosion (8). Do these in- 
teractions occur in human atherosclerotic 
plaque? Although Warnatsch et al. provide 
compelling human in vitro data, and NETs 
are present on the surface of eroding hu- 
man plaque (8), it will be important to 
verify that the observed murine phenotype 
translates to patients. Additionally, it is 
unknown if similar interactions between 
neutrophils and macrophages also occur in 
other organs, as these cells travel far and 
hypercholesterolemia is systemic. Hence, 
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it will be imperative to determine whether 
the phenotype observed in Apoe~i~IEla2~i~/ 
Prtn3~^~ mice is exclusive to NETosis; other 
leukocjdes, including macrophages, also 
express proteinase-3 (P). The high turn- 
over of plaque macrophages (JO) and the 
observation of NETosis in diabetes (Jl), a 
common risk factor for atherosclerosis, 
mean the striking phenot}q)e reported 
by Warnatsch et al. could depend on ad- 
ditional mechanisms that still 
await discovery. 

Warnatsch et al. introduce 
an interesting new facet to our 
understanding of how neutro- 
phils and macrophages commu- 
nicate. Macrophage activation 
through NETosis contrasts with 
neutrophil-macrophage interac- 
tion during resolution of inflam- 
mation. When macrophages 
engulf apoptotic neutrophils, a 
noninflammatory macrophage 
phenotype supports healing 
and a return to a steady state 
(i). Thus, the presence of cho- 
lesterol crystals dramatically 
influences cell-cell interactions 
and macrophage function. In 
considering possible therapies, 
inhibiting NETosis, or at least 
eliminating extracellular DNA, 
may prove beneflcial. An alter- 
native approach could reduce 
neutrophils by limiting their 
production in the bone marrow, 
migration, and/or recruitment. 
Both strategies would edit the 
instructions that macrophages 
receive from their tissue micro- 
environment, possibly curbing 
inflammasome activation and 
promoting inflammation resolution. Further 
downstream in the pathway, inhibiting the 
active form of IL-lp with a neutralizing anti- 
body is a promising strategy currently being 
tested in patients with atherosclerosis {12). ■ 
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By Ruth DeFries,^^ Jessica Fanzo,^ 

Roseline Remans, Cheryl Palm,® 

Stephen Wood,^® Tal L. Anderman® 

O ver the past half-century, the para- 
digm for agricultural development 
has been to maximize yields through 
intensifying production, particularly 
for cereal crops (i). Increasing pro- 
duction of high-yielding cereals— 
wheat, rice, and maize— has replaced more 
nutrient-rich cereals, which has eroded the 
content of essential dietary nutrients in 
the world’s cereal supply. New approaches 
are needed to produce healthy 
POLICY foods, rich in essential nutri- 
ents, with efficient use of land. 
Standard yield metrics that measure the 
quantity of production are inadequate to as- 
sess progress toward this goal; thus, we pro- 
pose alternative metrics of nutritional yields. 

Intensiflcation of agriculture through 
multiple crops per year, high-yielding seed 
varieties, S5mthetic fertilizer, mechaniza- 
tion, and other inputs increases 5 uelds and 
improves the efficiency of using land to pro- 
duce food. In the last 50 years, intensifica- 
tion of cereal production has increased the 
world’s cereal supply by a factor of almost 
2.2, outpacing the 1.3-fold increase in popu- 
lation growth {2). Intensiflcation was critical 
for averting food shortages that loomed on 
the horizon in the 1960s. 

Intensiflcation has spared 18 to 27 mil- 
lion hectares that would have been required 
to produce the same amount of cereals 
with yields equivalent to those in the mid- 
1960s (i). But intensiflcation can exacerbate 
land-clearing in the absence of appropriate 
policies and enforcement (8). Moreover, in- ^ 
tensiflcation relies on high inputs of energy, g 
fertilizer, pesticides, and water {4). Environ- | 
mental consequences include runoff of excess g 
fertilizer that damages water quality, toxicity | 
from pesticides, nitrous oxide emissions, and § 
degradation of habitat for biodiversity. % 

With increasing competition for land g 
(e.g., for carbon storage, watershed protec- i 
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Deploying NETs. Colored scanning electron micrograph of neutrophil 
extracellular traps (NETs, brown) capturing spores from the yeast 
Candida albicans (yellow). 
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the proportion of maize diverted for feed 
remained fairly stable over the time period, 
the change in the mix of cereal types had an 
even larger effect on reducing the nutritional 
content of the global, directly consumed ce- 
real supply than the diversion to feed (table 
SI and fig. S4). 

In other words, the amount of cereals that 
a person would need to consume to fulfill the 
daily dietary reference intake (DRI) has in- 
creased for protein, iron, and zinc, based on a 
mix of cereals in proportion to the production 
of each type. In 1961, 533, 821, and 735 g of ce- 
reals were needed to satisfy requirements for 
protein, iron, and zinc, respectively. By 2011, 
the amount required increased to 556, 1013, 



and 777 g. Grams required to satisfy energy 
requirements remained nearly unchanged at 
625 to 623 g over this time period. The nutri- 
ent-to-calorie ratio in our directly consumed 
cereal supply has declined, with less nutrient- 
dense cereals contributing to high levels of 
micronutrient deficiencies, particularly in 
low-income settings with cereal-based diets. 
Although our analysis did not address bio- 
availability our estimates of amounts needed 
to meet requirements would be expected 
to increase because of the relatively low 
bioavailability of micronutrients in grains, 
which depends strongly on processing and 
cooking procedures. 

Declining nutritional content of cere- 
als could be compensated by trade and in- 
creased consumption of other foods {IS). 
Diversion of cereals for livestock increases 
consumption of animal products and can 
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compensate for eroding nutritional content 
of cereals but requires more land because of 
low conversion efficiencies of animal feed to 
protein (16). Moreover, low-income popu- 
lations have limited access to substantial 
amounts of animal products. 

NEW METRICS. The Standard yield metric 
of agricultural production by weight per unit 
land neglects the importance of human nu- 
tritional needs as a critical factor for sustain- 
able intensification. We propose a metric of 
“nutritional 5 deld,” the number of adults who 
would be able to obtain 100% of their recom- 
mended DRI of different nutrients for 1 year 
from a food item produced annually on one 



hectare [see supplementary materials (SM)]. 
Conversely, the inverse of the metric indi- 
cates the land area required to grow enough 
food to supply one person with 100% of rec- 
ommended daily requirement of different 
nutrients from a food item for 1 year. 

In reality, people do not obtain 100% of 
recommended nutrient intake from a single 
food item, and processing and cooking fur- 
ther affect the nutrients available for human 
consumption (14). However, the metric al- 
lows comparison among different crops and 
production systems to evaluate nutritional 
value produced from a given land area. The 
metric could be compiled over different food 
items and along food value chains to measure 
nutritional 5 deld for a food system as a whole. 

In 2013, for example, on average one hect- 
are of rice produced 4.5 metric tons/year, 
which is the equivalent of providing the 
annual energy requirement for 19.9 adults. 
Millet produced only 0.9 metric tons/ha per 
year, the annual energy requirement for 4.0 
adults. However, a hectare of rice fulfills the 
annual iron requirement for only 7.6 adults, 
compared with 15.3 for millet. Similarly oats 
yield more zinc per ha than all other cereals 
except maize, despite providing fewer metric 



tons and energy per hectare than rice, wheat, 
and maize (see the graph, B). In India and 
LDCs, drought-tolerant millet and sorghum 
provide nutritional yields for iron higher 
than rice, despite relatively low yields in 
terms of metric tons per hectare. In China, 
wheat and rice have the lowest nutritional 
yields of all cereals for iron (fig. S6). 

Decisions about the desired mix of crops 
to achieve efficient use of land differ accord- 
ing to the priority: quantity of production or 
quality to produce essential dietary nutri- 
ents. The nutritional yield metric can guide 
decisions to simultaneously address both 
priorities at multiple scales. At the field and 
farm scales, the metric can quantify benefits 
of improved, nutrition-sensitive agricultural 
practices, such as the use of biofortified va- 
rieties and integrated soil fertility manage- 
ment. At a landscape or national scale, the 
metric can be applied to formulate policies 
that promote a mix of crops that balance 
productivity in terms of quantity and nutri- 
tional needs of the respective populations. At 
the global scale, the metric could be included 
in projections of food demand and require- 
ments for micronutrients and other macro- 
nutrients beyond energy. 

With growing pressures on land resources, 
food systems will be called upon to use land 
efficiently. At the same time, scarce land re- 
sources need to provide adequate nutrition 
for the world’s population and alleviate mi- 
cronutrient deficiencies. This confluence of 
imperatives calls for new alliances, metrics, 
and analyses for incorporating human nutri- 
tion as a primary consideration for sustain- 
able agriculture. ■ 
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SUPRAMOLECULAR CHEMISTRY 



Living supramolecular polymerization 

Greater control is achieved over the chain growth and properties of dynamic materials 



By Rahul Dev MuMiopadhyay^’^ 
and Ayyappanpillai Ajayaghosh^’^ 

H ost pol3oners that we encounter, like 
those used in grocery bags and soda 
bottles, are formed from small mol- 
ecules (monomers) held together by 
strong covalent bonds. In supramo- 
lecular polymerization, monomers 
bond through weaker reversible interactions, 
such as hydrogen bonds (H-bonds). Supra- 
molecular polymerization usually proceeds 
through step-growth mechanisms (i), where 
both ends of each monomer are reactive and 
many smaller oligomers form before long 
pol3miers appear. To overcome this prob- 
lem, a method developed for covalent poly- 
mers known as living polymerization has 
been adopted for supramolecular poljmiers 
to achieve better control and uniformity of 
chain growth and dispersity. 

Living polymerization is a type of chain- 
growth polymerization in which monomers 
undergo polymerization only upon reacting 
with an initiator to generate an active center. 
The active site regenerates with each mono- 
mer addition; it propagates along the pol3mier 
strand before transferring the active center 
to another pol3mier strand or terminating via 
mutual coupling. In fact, the last two steps of 
chain transfer or termination are essentially 
removed in a living polymerization process. 
The last monomer unit on a polymeric strand 
remains active until deliberately terminated, 
so adding more monomer— or a different 
monomer— resumes the reaction. Properties 
like the degree of polymerization (number of 
monomers in the chain), the chain confor- 
mation, and its lifetime (of the propagating 
chain) can therefore be efficiently controlled 
if a chain-growth polymerization is realized 
in such dynamic supramolecular systems (2). 

An early attempt on controlled supramo- 
lecular polymerization was made by Manners, 
Winnick, and co-workers (3) by assembling 
polyferrocenyldimethylsilane block copoly- 
mers in hydrocarbon solvents. Addition of a 
fresh feed of the poljmier in a good solvent 
like tetrahydrofuran caused the nanosized 
cylindrical micelle seeds to grow up to mi- 
crometers in length. Very recently, they have 
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Formation of micelles and nanofibers. (A) Laser scanning confocal microscopy image (scale bar, 5 pm) of self- 
assembled polyferrocenyldimethylsilane block copolymer micelles functionalized with fluorescence tunable BODIPY 
(boron-dipyrromethene) derivatives. Molecular structures responsible for different emission colors are represented: 
n-Bu, n-butyl (6). (B) Supramolecular seeded growth of nanofibers from molecule 1 proceeds slowly after first forming 
nanoparticles, but when these nanoparticles interact with existing nanofibers (C), the process is much faster (7). 
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prepared hierarchical [one-dimensional (ID) 
or 3D] multiblock comicelle structures by 
addition of different block copol 5 miers to the 
preformed micellar seeds, which were stable 
both in solution and in solid state (4). “Liv- 
ing crystallization”-driven block copolymer 
self-assembly was also used to prepare func- 
tionalized block copolymers (see the first 
figure, panel A) with segments of different 
emission colors (5, 6 ). 

Later, Sugiyasu, Takeuchi, and co-workers 
showed that systems undergoing self-assem- 
bly following a nucleation-elongation mech- 
anism coupled with a kinetically controlled 
pre-equilibrium process results in living su- 
pramolecular pol 3 miers (7). These authors 
observed that a porphyrin-based molecule 
initially forms that “wrong product,” the for- 



nanofibers of the organogelator as seeds. 

Although the above reports have taken us 
a few steps closer to the concept of living su- 
pramolecular pol 5 mierization, unimolecular 
control on chain growth remained elusive 
until recently Miyajima, Aida, and co-work- 
ers (P) reported an interesting property of 
a nonplanar bowl-shaped corannulene mol- 
ecule appended with five amide-functional- 
ized thio alkyl chains (see the second figure, 
panel A). This Cg-symmetric molecule did 
not undergo self-assembly via intermolecu- 
lar H-bonding in methylcyclohexane (MCH) 
because the formation of intramolecular 
H-bonds between the five amide chains is 
more conformationally feasible. However, 
upon heating, this metastable cagelike 
monomer opened up to undergo a sponta- 



When chiral initiators were used, po- 
l 5 mierization occurred only when the con- 
figuration of the stereogenic center of the 
monomer matched with that of the initiator. 
The monomer (M) with achiral side chains 
consisted of an asymmetric center at the 
corannulene core that formed a racemic 
mixture through a bowl-to-bowl inversion 
process. The chiral initiators (I) could dif- 
ferentiate between the enantiomers of M 
and undergo polymerization with selective 
handedness. This property was used to op- 
tically resolve a racemic mixture of mono- 
mers and by using either or as 
an initiator (see the second figure, panel B). 

Supramolecular living pol 3 mierization 
is a step closer to the precision synthesis 
of complex architectures and presents us 
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Unimolecular control of chain 
growth. (A) Unimolecular control 
in a supramolecular living chain- 
growth polymerization with initiator 
I and monomer M. (B). Schematic 
representation of optical resolution 
process is shown. The different M and I 
derivatives are represented below; Me, 
methyl (10). [Figures reproduced with 
permission from (6), (7), and (10)] 



mation of so-called “J-aggregates” that lead to 
metastable organic nanoparticles. After sev- 
eral days, these nanoparticles transformed 
into stable “H-aggregates” with nanofiber- 
like morphology via a cooperative mecha- 
nism (see the first figure, panel B). When an 
aliquot of the nanofibers was added to a so- 
lution of the nanoparticles, which serve as a 
reservoir for the porphyrin monomers, nano- 
fibers formed much more rapidly (within a 
few hours) through a living polymerization 
process (see the first figure, panel C). 

Recently, a clever molecular design by 
Wiirthner and co-workers (8) allowed a pro- 
longed lag time in the self-assembly process 
of a perylene bisimide-based organogela- 
tor by locking the molecule in an inactive 
conformation via induced intramolecular 
H-bonding. A living polymerization process 
was achieved by introducing preassembled 



neous ID self-assembly. Once such assembly 
formed, it continued to grow with a fresh 
supply of the metastable monomer even at 
low temperatures { 9 ). 

Further work by Miyajima, Aida, and 
co-workers used corannulene molecules 
functionalized with methyl-substituted 
amides as the initiator of the chain-growth 
polymerization because these molecules re- 
main in an open conformation in the ab- 
sence of intramolecular H-bonds and serve 
as suitable proton acceptors when a caged 
corannulene molecule approaches { 10 ). Po- 
lymerization was initiated only from one 
face of the monomer and promoted growth 
only in a particular direction. The resulting 
polymers were robust enough to remain 
stable even at a 10-fold dilution, so it was 
possible to analyze them by size exclusion 
chromatography. 



with a broad canvass to work toward new- 
generation functional materials. Controlled 
living polymerization in systems like triaryl- 
amines { 11 ) or donor-acceptor molecules re- 
mains challenging with respect to the design 
of organic electronic devices with optimum 
light-conversion efficiency. ■ 
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ARTIFICIAL INTELLIGENCE 

Virtual love 

A young programmer falls 
for a humanoid robot, but is 
the feeling mutual? 

By Rosalind W. Picard 

I n the opening scene of Ex Machina, 
Caleb Smith, a tall, young programmer, 
has just won a contest to spend a week 
at the home of his company’s founder, 
Nathan Bateman. Bateman is the genius 
programmer behind Bluebook, a cor- 
poration handling 94% of the world’s web 
search requests. On arriving at the isolated 
retreat, Caleb learns that he will be the first 
person to see if Nathan’s newest creation, the 
robot Ava, passes the Turing test. The clas- 
sic test, developed by Alan Turing, has the 
computer hidden from the human, who asks 
questions, receives answers via text, and in 
the end must judge, “Is this a human or a 
computer?” When Caleb points out that this 
will be different— that he will know that it’s 
a computer from the start— Nathan argues 
that it will be all the more impressive if Ca- 
leb knows that Ava is a computer and yet be- 
comes convinced that she is conscious. 

When Caleb meets Ava, we see close ups 
of Ava’s face and shots framing her sculpted 
breasts, transparent waistline, and inter- 
nally illuminated organs. Caleb responds as 
any healthy 26-year-old heterosexual male 
might, with a slack jaw. When Nathan asks 
Caleb, “How do you feel about her?” Caleb is 
effusive: “She’s fucking awesome.” The more 
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interesting question, however, is what Nathan 
asks next, “How does she feel about you?” 

As Caleb and Ava supposedly grow closer, 
we watch for the emotions to develop. When 
Caleb shares that his parents died when he 
was 15, Ava’s face remains unmoved; her 
pause is only perfunctory before changing the 
topic. Later, we see experiences that should 
elicit expressions of pain and do not. Her face 
reminds me of the face of a dermatologist I 
once met who had experimented a bit too ea- 
gerly with Botox. Scientists have shown that 
we read a neutral facial expression as happy 
if we felt happy before we saw it or as slightly 
sad if we felt slightly sad before we saw it. We 
may buy that Ava has feelings because she 
produces words that describe feelings and be- 
cause Nathan said “she can feel pleasure,” but 
we do not see her demonstrate convincing 
empathy or emotional intelligence (7). 

At one point, Caleb recognizes that Ava 
has “mind-reading” abilities. He does not 
mean the “I know all your thoughts” kind 
of mind reading that is routinely debunked 
but the ability to infer the likely mental state 
of another person— a simple task for most 
people. For example, if your phone is placed 
under a basket while you watch, and then 
moved while you are out of the room, when 
you come back to retrieve it, a “mind reader” 
would expect you to look for it where it had 
been and for you to be surprised to find that 
it is not where you left it. Ava speaks lines 
that imply that she is aware of minds and 
can reason about them. Some scientists argue 
that a lack of mind reading is the hallmark of 
autism, although there is also ample counter- 
evidence to this. As such, Nathan’s remark 
that autistic people are not aware of their 
own minds, or those of others, is the first line 
I would have put on the cutting room floor. 

When Caleb tries to engage Nathan intel- 
lectually about how Ava works, keywords like 
“stochastic” and “linearized” are sprinkled 
into the conversation in a way that is in- 



tended to sound erudite without boring the 
uninitiated. To those of us who build emo- 
tional artificial intelligence (AI), however, the 
phrases have the sound of a toddler randomly 
sampling keys on a Steinway grand piano. 

Nonetheless, the director, Alex Garland, 
has done some homework on the science of 
bringing emotions, emotional intelligence, 
and more to AI. He is wise, for example, to 
stretch the Turing test over multiple days 
(2). He is also clearly aware of how valuable 
it is to collect massive data to train the sys- 
tem. At one chilling point, Nathan tells Caleb 
how Ava was taught to read and synthesize 
facial expressions by turning on every camera 
and microphone in every cell phone on the 
planet and recording everybody without their 
knowledge. Still, the film leaps over the major 
breakthroughs that would be required before 
we could encounter such a future, namely, 
the complete lack of evidence that machines 
could ever have conscious feelings like ours. 

A particularly glaring gap is the fact that 
Ava is programmed without morality. Some 
might consider morality an option or an 
upgrade; however, smart machines do not 
evolve unguided. A machine’s choices are 
significantly biased by the procedures with 
which it has been programmed. Ava, and her 
actions, say more about the mind of her pro- 
grammer than about the robot he created. 
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PUBLIC HEALTH 

Food for 
thought 

A beginner’s guide 
to ethical eating 



By Nicholas Freudenberg 

S hould I get my morning coffee at 
Dunkin’ Donuts, where it is cheap but 
where it will be served by workers 
who are not paid a living wage (i)? Or 
should I go to Starbucks, where it will 
cost much more but workers get better 
pay and benefits (2)? When I shop for dinner 
tonight, should I buy organic, free range, or 
natural beef— or skip meat altogether to sup- 
port the rights of cows and the well-being of 
our planet? And when Monsanto urges me to 
reject a legislative proposal to require labels 
on genetically modified food, should I accept 
their argument that labeling will make food 
more expensive— especially for the poor? Or 
should I instead listen to my environmen- 
talist friends who assert that we should not 
expose humans and the planet to modified 
organisms whose long-term health and envi- 
ronmental impacts are unknown? 

These are among the questions Paul B. 
Thompson examines in From Field to Fork: 
Food Ethics for Everyone. His goal is not to 
tell us what to eat but to apply the methods 
of philosophy and ethics to the choices in- 
dividuals and societies have to make about 
food. He hopes to help readers find the right 
balance between “enabling choices vs. im- 
posing values on others.” His refreshing be- 
lief is that public dialogue and debate on the 
values and ethics of key food questions will 
enable policy-makers and the public to make 
more informed and equitable choices. 

Thompson is a philosopher at Michigan 
State University, where he holds the W. K. 
Kellogg Chair in Agricultural, Food, and 
Community Ethics. His 30 years’ experi- 
ence at the crossroads of philosophy, food, 
and agriculture make him a knowledgeable 
guide to food ethics, a field he helped to cre- 
ate. This book considers a variety of topics 
that have attracted media and public atten- 
tion: hunger and food insecurity in a land 
of plenty; the impact of the Green Revolu- 
tion and genetically modified crops on food 
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production, famine, and the environment; 
the ethical, health, and environmental ratio- 
nales for vegetarianism; and the human cost 
of cheap food (workers paid so little they 
must depend on public benefits to survive). 

Thompson’s deep knowledge gives him 
some unexpected insights. Although he 
doubts that existing evidence on genetically 
modified organisms (GMOs) warrants re- 
strictions for health or environmental rea- 
sons and accepts the contention that they 
can help make food more available to the 
world’s poor, he argues that their widespread 
use endangers traditional forms of agrarian- 
ism. The ancient Greeks, Thomas Jefferson, 
and current agroecologists have all claimed 
that small farmers are the foundation for 
democracy and equality. By patenting seeds 
and dominating global markets, a few mul- 
tinational companies have wrested decisions 
about how to farm from small growers and 
thus diminished their power to shape rural 
economies and politics. Thompson argues 
that these and other doubts about GMOs 
warrant continued skepticism about ex- 
panding their use. 

Thompson recognizes that how we grow, 
sell, and eat food cannot be separated from 
our economic and political systems. “Since 
everyone needs to eat,” he writes, “con- 
trol of food is a powerful locus for both 
profit-taking and the exercise of social con- 
trol.” However, the book does not directly 



From Field to Fork examines how our food choices 
affect society and the planet. 

consider some of the most urgent ethical 
questions confronting our food system. For 
example, are the business models of PepsiCo 
and McDonald’s ethical when they depend 
on persuading individuals to consume high- 
sugar, -fat, and -calorie products that are 
associated with costly diet-related diseases? 
And is New York City imposing a “nanny 
state” or simply living up to its mandate to 
protect public health when it seeks policies 
designed to restrict portion sizes or add 
warning labels to unhealthy foods? By not 
addressing such questions, Thompson leaves 
two key players in our food system— global 
food companies and government— curiously 
absent from his analyses. Because changing 
the practices of these two food behemoths is 
the goal of the global food justice movement, 
a deeper analysis of the ethical frameworks 
each uses could have helped to advance the 
search for sustainable and moral solutions to 
global food problems. 

Despite this gap. From Field to Fork is a 
sensible and engaging introduction to food 
ethics. Thompson avoids moralizing and 
rejects the tendency of some scientists to 
assume that evidence alone can settle food 
policy fights without engaging with the 

c/3 

moral arguments of critics. By emphasizing S 
that our food system ultimately reflects our J 
values, Thompson sets the table for more | 
satisfying discussions of food policy. | 
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Edited hy Jennifer Sills 

Holocene as 
Anthropocene 

IN THEIR PERSPECTIVE “Defining the 
epoch we live in” (3 April, p. 38), W. F. 
Ruddiman et al. write that in spite of its 
popularity, the Anthropocene still lacks an 
official onset. They propose that the term 
anthropocene be used informally (without 
the initial capital), which would avoid 
the constraints of a formal designation. 
We disagree. 

The human footprint on the planet is 
so distinctive and lasting that the Age 
of Man must be officially recognized 
in the geologic time scale. We support 
considering the Anthropocene and the 
Holocene as a single geologic time span. 
This solution has already been pro- 
posed (i) but was inexplicably ignored. 
Combining the two epochs would provide 
the Anthropocene with a climatic jus- 
tification {2). It was the end of the last 
glacial period that allowed the increase 
of the human population and its role as 
a geologic force, and anthropic impacts 
on the planet took place during most of 
the Holocene. Moreover, the latest studies 
are placing the appearance of domes- 
ticated plants and animals— which can 
plausibly be considered the onset of the 
Anthropocene (3)— increasingly earlier in 
time, closer and closer to the 11.7 ky B.P. 
Pleistocene-Holocene boundary (4). 

Even if the impact of humans was not 
immediately major and uniform across 
the planet, what are a few thousand 
years of discrepancy between the end of 
the last glacial period and the dawn of 
the Anthropocene in comparison to the 
entire geological scale? Most transitions 
between the formally defined epochs 
would be much longer than the Holocene. 
Anthropocene seems a more reasonable 
name than Holocene for this combined 
time span, whose most characteristic 
trait is the human pressure on the planet. 
Holocene could possibly be the first stage 
of the Anthropocene, the one character- 
ized by a soft and spotty human impact 
on Earth. 
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At what point did humans’ impact on Earth become geologically significant? 
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Geological evidence 
for the Anthropocene 

DEFINITIONS ARE ONE of the bedrocks 
of science. However, W. F. Ruddiman 
et al. (“Defining the epoch we live in,” 
Perspectives, 3 April, p. 38) propose that 
the geological term Anthropocene should 
remain deliberately undefined and ambigu- 
ous. They recommend not formalizing the 
term because some inception dates miss 
important earlier human-induced environ- 
mental impacts, particularly widespread 
farming. We have sjmipathy for this view. 
However, defining the Anthropocene 
as a geological epoch must be based on 
evidence, including changes to the Earth 
system lasting millions of years and the 
existence of stratigraphic evidence mark- 
ing such changes (i). All other geological 
time-units have agreed-upon dated markers 
or agreed-upon dates. The Anthropocene 
should not be treated differently. 

Ruddiman et al. discuss a recent paper 
suggesting that the Anthropocene began 
in 1945 (2). Zalasiewicz et al. (2) note, as 
have others (3), that many stratigraphic 
changes are coeval with the mid-to-late 
20th century. However, they then side-step 
geological convention by not specifying a 
Global Stratotype Section and Point (GSSP) 
to define the Anthropocene. Instead, they 
choose the date of the first nuclear weapon 
detonation (2), which created no strati- 
graphic evidence in 1945. Given that GSSPs 
are the preferred method of marking 
geological time units (i), 1945 is unlikely 



to obtain the backing of the multiple com- 
mittees of geologists necessary to formally 
define the Anthropocene epoch (4). 

The appropriate GSSP marker relating 
to nuclear weapons is the globally measur- 
able peak in radionuclide fallout, 
in 1964, after the Partial Test Ban Treaty 
(3). An earlier date also likely adheres 
to the geological criteria for defining an 
epoch: 1610. This date marks the irrevers- 
ible exchange of species following the 
collision of the Old and New worlds; an 
associated unusual drop in atmospheric 
CO 2 captured in Antarctic ice cores 
provides the GSSP marker (3). This date 
implicitly includes the impacts of 
farming, as the drop in CO^ largely 
resulted from vegetation regrowth on 
abandoned farmlands following the 
deaths of 50 million indigenous Americans 
(mostly from smallpox brought by 
Europeans). The annexing of the Americas 
by Europe was also an essential precursor 
to the Industrial Revolution and therefore 
captures associated later waves of envi- 
ronmental change (3). Choosing among 
alternative inception dates will be chal- 
lenging (5). 

Finally, Ruddiman and colleagues sug- 
gest using the term Anthropocene with a 
small “a” to denote its informality. This is 
not consistent with geological-community 
norms. Formal time-unit names have a 
capital letter: Anthropocene Epoch versus 
Anthropocene epoch for the informal term 
(i, 3). An elegant solution to referring to 
time before an agreed-upon Anthropocene 
Epoch is already available: the paleoan- 
thropocene (6). Geological norms should 
be followed when considering defining the ^ 

Anthropocene. ^ 
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Response 

CERTINI AND SCALENGHE and Lewis 
and Maslin seem convinced that geo- 
science must rely heavily on formal 
stratigraphic nomenclature to move 
forward. This view likely originates from 
the centuries-long effort to compile the 
relative geologic age sequence, which was 
a monumental achievement, especially 
given religious and social opposition. The 
compilation required careful attention to 
stratigraphic principles such as superposi- 
tion, index fossils, diachroneity and facies 
changes. Over time, the findings became 
formalized in geologic nomenclature. 

However, since the mid-1900s, geochem- 
ical (radiometric) dating has increasingly 
overshadowed (although not replaced) 
those time-honored principles. In my 
experience, most geoscience papers in 
recent decades initially refer to geologic 
nomenclature as a way of providing a 
familiar frame of reference, but they soon 
shift focus to the best geochemical dating 
available. Although my view will displease 
some geological colleagues, isn’t it time 
to acknowledge that almost all dating of 
geological sequences now takes place in 
the “geochemical age” (with a small “a”)? 

Lewis and Maslin propose that we 
designate a “Paleoanthropocene” (capital 
“P”). Should we place its start at the mass 
extinction of marsupials in Australia near 
50,000 years ago? Or should we choose the 
mass extinction of large mammals in the 
Americas almost 40,000 years later? Or, if 
we decide to pick a level during the 10,000 
years or more when crops and livestock 
were being domesticated and agriculture 
was spreading across the continents, 
which one time within that long span 
should we use for a formal designation? 
Could any choice be satisfying, given 
that it would have to ignore the rest of 
the long and rich history of early human 
influences? 

In contrast, informal use of these terms 
(anthropogenic, anthropocene, and paleo- 
anthropocene) would present no such 
problems or constraints. 
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ABSTRACTS 

Comment on “Sedimentary DNA from 
a submerged site reveals wheat in the 
British Isles 8000 years ago” 

K. D. Bennett 

Smith et al. (Reports, 27 February 2015, p. 
998) identify wheat DNA from an 
8000-calendar-years-before-the-present 
archaeological site in southern England and 
conclude that wheat was traded to Britain 
2000 years before the arrival of agriculture. 
The DNA samples are not dated, either 
directly or from circumstantial evidence, so 
there is no chronological evidence to support 
the claim. 

Full text at http://dx.doi.org/10.1126/science. 
aabl886 

Response to Comment on “Sedimentary 
DNA from a submerged site reveals 
wheat in the British Isles 8000 years ago” 

Oliver Smith, Garry Momher, Richard Bates, 
Paul Garwood, Simon Fitch, Mark Pollen, 

Vincent Gaffney, Robin G. Allaby 
Bennett questions the rigor of the dating of 
our sample from which sedimentary ancient 
DNA was obtained and the reliability of 
the taxonomic identification of wheat. We 
present a further radiocarbon date from 
S308 that confirms the lateral consistency 
of the palaeosol age. The suggestion of 
taxonomic false positives in our data 
illustrates a misinterpretation of the 
phylogenetic intersection analysis. 

Full text at http://dx.doi.org/10.1126/science. 
aab2062 



ERRATA 

Erratum for the Report “Mutation rate 
and genotype variation of Ebola virus from 
Mali case sequences” by T. Hoenen et al., 

Science 348, aac5674 (2015). Published online 
22 May 2015; 10. 1126/science. aac5674 

Erratum for the Report: “Molten uranium 
dioxide structure and dynamics” by L. B. 
Skinner ctaZ., Science 348, aab3869 (2015). 
Published online 1 May 2015; 10.1126/science. 
aab3869 

Correction for the Report: “The in vivo 
dynamics of antigenic variation in 
Trypanosoma hruceV* by M. R. Mugnier 
etal., Science 347, aaa4502 (2015). Published 
online 22 April 2015; 10.1126/science.aaa4502. 
Fig. 2A contained errors in the print version 
of Science. The correct figure is displayed 
online. 

Erratum for the Research Article: “Neu- 
rotransmitter switching in the adult brain 
regulates behavior” by D. Dulcis et al., 

Science 348, aab2338 (2015). Published online 
10 April 2015; 10.1126/science.aab2338 
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TECHNICAL COMMENT 



ARCHAEOLOGY 

Comment on “Sedimentary DNA from 
a submerged site reveals wheat in 
the British Isles 8000 years ago” 

K. D. Bennett^’^ 

Smith et al. (Reports, 27 February 2015, p. 998) identify wheat DNA from an 8000-calendar- 
years-before-the-present archaeological site in southern England and conclude that 
wheat was traded to Britain 2000 years before the arrival of agriculture. The DNA samples 
are not dated, either directly or from circumstantial evidence, so there is no chronological 
evidence to support the claim. 



C urrent understanding of the spread of ag- 
riculture across Europe indicates arrival in 
Britain about 6000 years ago, having taken 
about 2000 years to spread across central 
Europe and the English Channel from the 
Mediterranean (1). There is abundant evidence 
for the occurrence of early forms of wheat in Britain 
after 6000 calendar years before the present (cal 
yr B.P.), so the main claim of Smith et al.’s inves- 
tigations at Bouldnor Qiff southern England, is 
the early date of about 8000 cal yr B.P. (2), rather 
than the discovery of wheat itself. 

The samples examined for DNA (2) come from 
a monolith, S308, collected ‘from a location at the 
site” [p. 999 in (2)], or “in the proximity MS-08 
and MS-05, which were adjacent to each other” 
[p. 4 in the supplementary materials (SM) for (2)]. 
Nowhere in the Report is there any mention of a 
measurement between S308 and other monoliths, 
either horizontally or vertically. It is not clear how 
much time elapsed between the collection of the 
dated monoliths and S308. All the radiocarbon 
dating was carried out on monoliths other than 
S308 [table SI in the SM for (2)]. No information 
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is provided on other analyses that would provide 
a firm link between the DNA analyses of S308 
and any sample of the dated monoliths. Sedi- 
mentary analyses were carried out on the other 
monoliths (3) and might easily have been applied 
to S308 to provide a link even if, for some reason, 
it was not possible to obtain a radiocarbon date 
directly from the sediment matrix (within which 
the wheat DNA was obtained) of S308. 

The stratigraphic section for Bouldnor Cliff 
[figure IB in (2)] shows a complex series of coast- 
al sediments, dipping into a channel. This raises 
the possibility that samples taken later, even if 
at the same measured location (horizontal and 
vertical) as the originals, might have come from 
a stratigraphically different part of the sequence, 
depending on accuracy and precision of both 
horizontal and vertical measurements, as well as 
allowance for erosion between the dates of sam- 
ple collection. There are also inconsistencies in 
the presentation of the stratigraphy, including the 
relative extents of monoliths MS-04 and MS-07 
[separated by a gap in figure IB in (2) but con- 
tiguous in figure S2 in the SM for (2)]. Reference 
is made to a paleosol, which might incorporate 
material from a wide range of ages. 

The only evidence for wheat at the site comes 
from DNA. There are no wheat macrofossils or 
wheat pollen in the samples. The contention is 
that this indicates that the wheat did not come 
from nearby agriculture (2). However, whenever 



the wheat was incorporated in the sediment 
(from nearby agriculture or trading), it must have 
been associated with macrofossil remains, such 
as wheat grains. Both hazel and alder macrofos- 
sils (but no DNA) were found in or near S308 
(2), so it is curious that no trace of wheat re- 
mains have been found. There are inconsisten- 
cies in the DNA— even after the phylogenetic 
intersection analysis (PIA)— including the pres- 
ence of DNA from animals known in the mid- 
Holocene only from regions far removed from 
southern Britain, such as Ursus mantimus (polar 
bear, Arctic) and Cervus nippon (Sika deer. East 
Asia), and from tropical grasses (Panicoideae). 
These suggest that some false positives are coming 
through PIA or that the taxonomic precision in- 
dicated is inappropriate for the content of the 
database, which might be relevant to the age or 
origin of the wheat identification. 

Given that S308 contains DNA of wheat, how 
might it have arrived there? There are a number 
of possibilities. The samples may be contempo- 
raneous with the monoliths (2), so the wheat may 
have been traded from Europe, presumably all the 
way from the Mediterranean at 8000 cal yr B.P. 
(i). Alternatively, if the samples are actually much 
younger, the wheat may have been incorporated 
from nearby agriculture, perhaps from eroded 
soil, and ended up near to early Holocene sam- 
ples through the vagaries of sedimentation in a 
complex channel and coastal situation. The lat- 
ter is the more parsimonious view, in the absence 
of evidence to the contrary. Finally, the identifica- 
tion of the wheat DNA may be another false posi- 
tive in the comparisons with database DNA 

The claim that wheat was being traded to 
Britain at 8000 cal yr B.P. has substantial impli- 
cations for understanding of the archaeology of 
northwest Europe in the early postglacial pe- 
riod. Such a claim can be readily substantiated by 
dating the sediment matrix of the samples direct- 
ly. A claim based on undated samples, lacking 
any incontrovertible link to dated samples, is 
insufficient to overturn current understanding. 
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Response to Comment on 
“Sedimentary DNA from a submerged 
site reveals wheat in the British 
Isles 8000 years ago” 

Oliver Smith, ^ Garry Member,^ Richard Bates, ^ Paul Garwood,^ Simon Fitch,^ 

Mark Fallen,®* Vincent Gafftiey,^* Robin G. AIlaby^*t 

Bennett questions the rigor of the dating of our sample from which sedimentary 
ancient DNA was obtained and the reliability of the taxonomic identification of wheat. 
We present a further radiocarbon date from S308 that confirms the lateral consistency of 
the palaeosol age. The suggestion of taxonomic false positives in our data illustrates a 
misinterpretation of the phylogenetic intersection analysis. 



B ennett (i) raises the point that possible 
variances in the depositional environment 
make it inappropriate to apply radiocarbon 
dates from nearby monoliths MS-05 and 
MS-08 in our study {2), raising the pos- 
sibility that the wheat DNA we identified could 
possibly be younger than the 8000 years we 
claim. The evidence from the radiocarbon dates 
at the site and those we presented is that the 
palaeosol is of a consistent age over a 420-m area 
reaching between the sites of Bouldnor Cliff II, 
on which our study was based, and Bouldnor 
Cliff V (3). We therefore felt that the site was 
dated securely enough in this study. However, 
we accept that Bennett’s argument is most ro- 
bustly refuted by obtaining a radiocarbon date 
from the sample S308, which is a box sample 
taken from the cliff as outlined in the methods 
and not, as Bennett suggests, a monolith. We 
attempted to date both a twig from the sample 
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and the humic acid fraction of the sediment 
itself. Sediment dating is problematic because 
of the risk of inclusion of older carbon sources 
and can lead to overestimation of age. In this 
case, the humic acid content of the sandy clay was 
too low to provide a direct sediment date, but the 
twig returned an age of 7935 to 7790 calendar 
years before the present (Beta-406961), confirm- 
ing the age of the sample from which the sedi- 
mentary ancient DNA (sedaDNA) was obtained. 

Bennett asserts that the phylogenetic inter- 
section analysis (PIA) fails to filter out exotic 
species such as Ursus maritimus (polar bear), 
Cervus nippon (Sika deer), and tropical pani- 
coids, and as such casts doubt on the validity of 
the identity of the wheat DNA. This is incorrect 
and shows some misunderstanding of the DNA 
analysis. The analysis is shown to be highly ro- 
bust and does not in any way falsely identify 
polar bears, Sika deer, or tropical grasses as 
being present at the site. These are instances of 
closest match between sedaDNA reads and the 
database; it would be naive to interpret these 
as a species identification, for reasons we ex- 
plained in some depth in the supplementary 
materials of {2). These are not instances of reads 
that have been filtered out by the PIA, but rather 
meet the criteria of the analysis. The PIA is pred- 
icated on the fact that, due to variable data- 
base representation, the species of origin may 
not be present in the database; indeed, often, 
large taxonomic orders are represented by only 
a very few species for many genomic regions. 
The robustness of the analysis comes from 



examining the phylogenetic range of similar DNAs 
within a database. In fact, Bennett is alluding to 
sedaDNA that is attributed robustly to Carnivora, 
Caniformia, or Ursidae, in the case of the “polar 
bear,” meaning that the true species of origin 
lies somewhere in the phylogenetic range en- 
compassed by this order, suborder, or family, 
respectively. Similarly, cervinae is identified in 
the case of the “Sika deer,” and various uncon- 
troversial higher taxonomic orders of grasses are 
identified in the analysis in which the closest data- 
base match is from a tropical grass such as rice. 

Factors such as genome size and how well 
genomes are represented in databases have a 
great influence on the resulting frequency in 
the DNA profile. Wheat has a large and well- 
characterized genome, and, furthermore, many 
wheat species and the sister species Aegilops 
have been characterized. This means that we 
have a great power to detect wheat relative to 
other organisms from metagenomic profiles 
using the PIA, which is reflected in the relatively 
large number of wheat sequences we identify. 
Given the high level of accuracy (81%) of the 
analysis, the evidence for the presence of wheat 
at Bouldnor is overwhelming. 

Finally, Bennett asserts that sedaDNA re- 
quires the existence of macrofossil sources of 
DNA. The sandy clay from which we took the 
sedaDNA sample was largely devoid of macro- 
fossils save for a few twigs, with a low organic 
content, despite the large number of taxonomic 
orders that were identified. Bouldnor Cliff has 
not been extensively excavated yet, and it is pos- 
sible that the quantities of grain involved may 
be very small and that such wheat macrofossils 
remain to be discovered. However, the presence 
of sedaDNA in the absence of macrofossils is 
precedented (4). We can make no conclusion 
about the source of the DNA without further 
evidence. However, if rapid transport occurred— 
such as might be expected, for instance, using 
boats associated with pioneer groups on the 
western coast of France— then the source may 
even have been flour rather than grain. 
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Illustration of how a virtual 
interviewer sees a human. 

The interviewer assesses the 
psychological state of the 
human by tracking and analyzing 
their facial expressions, body 
posture, and speech. 
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SPECIAL SECTION 



RISEOFTHE 

MACHINES 

By Jelena Stajic, Richard Stone, Gilbert Chin, and Brad Wible 



A lthough most would agree that the average person is smarter 
than the average cat, comparing humans and machines is 
not as straightforward. A computer may not excel at abstract 
reasoning, but it can process vast amounts of data in the blink 
of an eye. In recent years, researchers in artificial intelligence 
(AI) have used this computational firepower on the scads of data 
accumulating online, in academic research, in financial records, 
and in virtually all walks of life. The algorithms they develop 
help machines learn from data and apply that knowledge in 
new situations, much like humans do. The ability of computers to extract 
personal information from seemingly innocuous data raises privacy 
concerns. Yet many AJ systems indisputably improve our lives; for 
example, by making communication easier through machine translation, 
by helping diagnose illness, and by providing modern comforts, such as 
your smartphone acting as your personal assistant. This special issue 
presents a survey of the remarkable progress made in AI and outlines 
challenges lying ahead. 

Many AJ systems are designed for narrow applications, such as playing 
chess, flying a jet, or trading stocks. AJ researchers also have a grander 
aspiration: to create a well-rounded and thus more humanlike intelligent 
agent. Scaling that research peak is daunting. But triumphs in the field 
of AI are bringing to the fore questions that, until recently, seemed better 
left to science fiction than to science: How will we ensure that the rise of 
the machines is entirely under human control? And what will the world 
be like if truly intelligent computers come to coexist with humankind? 
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The synthetic therapist 

Some people prefer to bare their souls to computers 
rather than to fellow humans 



P eople have always noticed Yrsa 
Sverrisdottir. First it was ballet, 
which she performed intensively 
while growing up in Iceland. Then 
it was science, which she excelled 
at and which brought her to the 
stage at conferences. And starting 
in 2010, when she moved to the 
University of Oxford in the United 
Kingdom to study the neurophysiology of the 
heart, it was her appearance. With her Nor- 
dic features framed by radiant blonde hair, “I 
just stand out here,” she says. “I can’t help it.” 
After she arrived in the United Kingdom, 
she found that she no longer enjoyed the 
attention. She began to feel uncomfortable 
in crowds. Her relationships suffered. There 
had been some obvious stressors, such as 
the deaths of both of her parents. But the 
unease didn’t let up. By 2012, she says, “I felt 
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like I was losing control.” Then she met Fjola 
Helgadottir, one of the few other Icelanders 
in town. Helgadottir, a clinical psychology 
researcher at Oxford, had created a com- 
puter program to help people identify and 
manage psychological problems on their 
own. Sverrisdottir decided to give it a try. 

The program, based on a technique 
called cognitive behavioral therapy (CBT) 
and dubbed CBTpsych, begins with sev- 
eral days of interactive questioning. “It 
was exhausting,” Sverrisdottir says. The 
interrogation started easily enough with 
basic personal details, but then began to 
probe more deeply. Months of back and 
forth followed as the program forced her 
to examine her anxieties and to identify 
distressing thoughts. CBTpsych diagnosed 
her as having social anxiety, and the in- 
sight rang true. Deep down, Sverrisdottir 



realized, “I didn’t want people to see me.” 

Then the program assumed the role of 
full-fledged therapist, guiding her through 
a regimen of real-world exercises for tak- 
ing control. It sounds like a typical success 
story for clinical psychology. But no human 
psychologist was involved. 

CBTpsych is far from the only computer- 
ized psychotherapy tool available, nor the | 
most sophisticated. Ellie, a system built at § 
the University of Southern California (USC) | 
in Los Angeles, uses artificial intelligence ^ 
(AI) and virtual reality to break down barri- § 
ers between computers and humans. Origi- | 
nally funded by the US. military, its focus 2 
is on diagnosing and treating psychological % 
trauma. Because patients interact with a | 
digital system, the project is generating a § 
rare trove of data about psychotherapy itself. | 
The aim, says Albert “Skip” Rizzo, the USC I 
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psychologist who leads the effort, is nothing 
short of “dragging clinical psychology kick- 
ing and screaming into the 21st century.” 

A 19 June editorial in The New York Times 
deemed computerized psychotherapy “ef- 
fective against an astonishing variety of 
disorders.” The penetration of the Internet 
into far-flung communities could also bring 
mental health treatment to vast numbers of 
people who otherwise have no access. 

But whether clinical psychologists will 
accept AI into their practice is uncertain. 
Nor is it clear that the tools of AI can carry 
computerized psychotherapy beyond its so 
far limited capacity, says Selmer Bringsjord, 
a cognitive scientist and AI researcher at 
Rensselaer Polytechnic Institute in Troy, New 
York. “It is incredibly ambitious.” 

ALL OF TODAY’S VIRTUAL psychologists 
trace their origins to ELIZA, a computer pro- 
gram created half a century ago. Named after 
the young woman in Pygmalion who rapidly 
acquires sophisticated language, ELIZA was 
nothing more than a few thousand lines of 
code written by Joseph Weizenbaum and 
other computer scientists at the Massachu- 
setts Institute of Technology 
(MIT) in the early 1960s to study 
human-computer interaction. 

ELIZA followed rules that de- 
termined how to respond during 
a dialogue. The most convinc- 
ing results came from a rule set 
called DOCTOR that simulated a 
psychotherapist: By turning patients’ state- 
ments around as questions, the program 
coaxed them to do most of the talking. For 
instance, in response to a patient sa 5 ung, “I 
feel helpless,” the computer might respond, 
“Why do you think you feel that way?” (You 
can talk to ELIZA yourself at http://psych.ful- 
Ierton.edu/mbirnbaum/psychl01/EIiza.htm.) 

People engaged readily with ELIZA, 
perhaps more for its novelty than its con- 
versational skills, but AI researchers were 
unimpressed. “The idea that you could make 
a convincing AI system that didn’t really 
have any intelligence was seen as cheating,” 
says Terry Winograd, a computer scientist at 
Stanford University in Palo Alto, California, 
who was a Ph.D. student down the hall from 
Weizenbaum. This was a wildly optimistic 
time for the field, with many researchers an- 
ticipating computers with human-level gen- 
eral intelligence right around the corner. 

But work on artificial general intelligence 
didn’t pan out, and funding and interest 
dried up in what has come to be known as 
the “AI winter.” It wasn’t until the turn of the 
new millennium that mainstream interest in 
AI resurged, driven by advances in “narrow 
AI,” focusing on specific problems such as 
voice recognition and machine vision. 

SCIENCE sciencemag.org 



Conversational “chatbots” such as ELIZA 
are still viewed as a parlor trick by most com- 
puter scientists {Science, 9 January, p. 116). 
But the chatbots are finding a new niche in 
clinical psychology. Their success may hinge 
on the very thing that AI researchers eschew: 
the ability of an unintelligent computer to 
trick people into believing that they are talk- 
ing to an intelligent, empathetic person. 

THAT ISN’T EASY, as Rizzo is keenly aware. 
What most often breaks the spell for a pa- 
tient conversing with Ellie isn’t the content 
of the conversation, because the computer 
hews closely to a script that Rizzo’s team 
based on traditional clinical therapy ses- 
sions. “The problem is entrainment,” he 
says, referring to the way that humans sub- 
consciously track and mirror each other’s 
emotions during a conversation. 

For example, a patient might say to 
Ellie, “Today was not the best day,” but the 
voice recognition software misses the 
“not.” So Ellie smiles and exclaims, “That’s 
great!” For an AI system striving to bond 
with a human patient and earn trust, Rizzo 
says, “that’s a disaster.” 



To improve entrainment, a camera tracks a 
patient’s psychological signals: facial expres- 
sion, posture, hand movement, and voice 
dynamics. Ellie crunches those data in an at- 
tempt to gauge emotional state. 

The patterns can be subtle, says Louis- 
Philippe Morency a computer scientist at 
use who has led the development of the AI 
that underlies Ellie. For instance, he says, 
a person’s voice may shift “from breathy 
to tense.” The team devised algorithms to 
match patterns to a likely emotional state. 
It’s imperfect, he says, but “our experiments 
showed strong correlation with [a patient’s] 
psychological distress level.” 

Other patterns unfold over multiple ses- 
sions. For instance, the team’s work with 
US. veterans suffering from post-traumatic 
stress disorder (PTSD) revealed that “smile 
dynamics” are a strong predictor of depres- 
sion. The pattern is so subtle that it took 
a computer to detect it: Smiling frequency 
remained the same in depressed patients, 
on average, but the duration and intensity 
of their smiles was reduced. 

Even if Ellie were to achieve perfect en- 
trainment, Rizzo says, it “is really just an en- 
hanced ELIZA.” The AI under the hood can 
only sustain about a 20-minute conversation 



before the spell breaks, which limits the sys- 
tem’s usefulness for diagnosis and treatment 
of most psychological problems. Without 
sophisticated natural language processing 
and semantic knowledge, Ellie will never 
fool people into believing that they are talk- 
ing to a human. But that’s okay, Rizzo says: 
Becoming too humanlike might backfire. 
One counterintuitive finding from Rizzo’s lab 
came from telling some patients that Ellie is 
a puppet controlled by a human while telling 
others she is fully autonomous. The patients 
told there was a puppeteer were less engaged 
and less willing to open up during therapy. 

That’s no surprise to AI researchers 
like Winograd. “This goes right back to 
ELIZA,” he says. “If you don’t feel judged, 
you open up.” 

Ethical and privacy issues may loom if AI 
therapy goes mainstream. Winograd worries 
that online services may not be forthcoming 
about whether there is a human in the loop. 
“There is a place for deceiving people for 
their own good, such as using placebos in 
medicine,” he says. But when it comes to 
AI psychology, “you have to make it clear to 
people that they are talking to a machine 
and not a human.” 

If patients readily open up 
to a machine, will clinicians be 
needed at all? Rizzo is adamant 
that a human must always be 
involved because machines can- 
not genuinely empathize with 
patients. And Ellie, he points 
out, has a long way to go before being ready 
for prime time: The program does not yet 
have the ability to learn from individual pa- 
tients. Rizzo envisions AI systems as a way 
to gather baseline data, providing psycholo- 
gists with the equivalent of a standard bat- 
tery of blood tests. “The goal isn’t to replace 
people,” he says, “but to create tools for hu- 
man caregivers.” 

Helgadottir has a bolder vision. Although 
computers are not going to replace thera- 
pists anytime soon, she says, “I do believe 
that in some circumstances computerized 
therapy can be successful with no human 
intervention ... in many ways people are not 
well suited to be therapists.” A computer 
may be more probing and objective. 

Sverrisdottir’s experience suggests that 
CBTpsych, at least, can make a difference. 
Under the program’s tutelage, she says, 
“very slowly I started to analyze myself 
when I’m amongst other people.” She identi- 
fied a pattern of “negative thoughts about 
people judging me.” 

She might have got there with a human 
therapist, she says. But in the years since she 
first started talking to a computer about the 
trouble swirling in her mind, Sverrisdotter 
says, “I have been able to change it.” ■ 
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The goal is ‘‘dragging clinical psychology 
kicking and screaming into the 21st century” 

Albert “Skip” Rizzo, University of Southern California 
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Fears of an Al pioneer 

Stuart Russell argues that AI is as dangerous 
as nuclear weapons By John Bohannon 



F rom the enraged robots in the 1920 
play R.tZR. to the homicidal computer 
H.A.L. in 2001: A Space Odyssey, sci- 
ence fiction writers have embraced the 
dark side of artificial intelligence (AI) 
ever since the concept entered our col- 
lective imagination. Sluggish progress in AI 
research, especially during the “AI winter” 
of the 1970s and 1980s, made such worries 
seem far-fetched. But recent breakthroughs 
in machine learning and vast improvements 
in computational power have brought a flood 
of research funding— 
and fresh concerns 
about where AI may 
lead us. 

One researcher now 
speaking up is Stuart 
Russell, a computer 
scientist at the Uni- 
versity of California, 
Berkeley, who with 
Peter Norvig, director 
of research at Google, wrote the premier AI 
textbook. Artificial Intelligence: A Modem 
Approach, now in its third edition. Last year, 
Russell joined the Centre for the Study of Ex- 
istential Risk at Cambridge University in the 
United Kingdom as an AI expert focusing on 
“risks that could lead to human extinction.” 
Among his chief concerns, which he aired at 
an April meeting in Geneva, Switzerland, run 
by the United Nations, is the danger of put- 
ting military drones and weaponry under the 
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full control of AI systems. This interview has 
been edited for clarity and brevity. 

Q: What do you see as a likely 
path from AI to disaster? 

A: The basic scenario is explicit or implicit 
value misalignment: AI systems [that are] 
given objectives that don’t take into ac- 
count all the elements that humans care 
about. The routes could be varied and 
complex— corporations seeking a super- 
technological advantage, countries tr 3 dng to 
build [AI systems] before their enemies, or 
a slow-boiled frog kind of evolution leading 
to dependency and enfeeblement not un- 
like E. M. Forster’s The Machine Stops. 

Q: You’ve grappled with this 
Issue for a long time. 

A: My textbook has a section “What If We 
Do Succeed?” devoted to the question of 
whether human-level AI or superintelligent 
systems would be a good idea. [More recent 
causes for concern are] the rapid develop- 
ments in AI capabilities such as the legged 
locomotion of Big Dog [the autonomous 
robot created by Boston Dynamics, recently 
acquired by Google] and progress in com- 
puter vision. 

Q: What needs to be done to 
prevent an AI catastrophe? 

A: First, research into the precise nature 
of the potential risk and development of 



technical approaches to eliminate the risk. 
Second, modification of the goals of AI and 
the training of students so that alignment 
of AI systems with human objectives is 
central to the field, just as containment is 
central to the goals of fusion research. 

Q: But by the time we were developing 
nuclear fusion, we already had atomic 
bombs, whereas the AI threat seems 
speculative. Are we really at the “fusion” 
stage of AI research? 

A: Here’s what Leo Szilard wrote in 1939 
after demonstrating a [nuclear] chain 
reaction: “We switched everything off and 
went home. That night, there was very 
little doubt in my mind that the world 
was headed for grief.” To those who say, 
well, we may never get to human-level or 
superintelligent AI, I would reply: It’s like 
driving straight toward a cliff and saying, 
“Let’s hope I run out of gas soon!” 

Q: The Intention with fission was to create 
a weapon. The Intention with AI Is to create 
a tool: Intelligence on tap. Does that explain 
the reluctance to regulate AI? 

A: From the beginning, the primary inter- 
est in nuclear technology was the “inex- 
haustible supply of energy.” The possibility 
of weapons was also obvious. I think there 
is a reasonable analogy between unlimited 
amounts of energy and unlimited amounts 
of intelligence. Both seem wonderful until 
one thinks of the possible risks. In neither 
case will anyone regulate the mathemat- 
ics. The regulation of nuclear weapons 
deals with objects and materials, whereas 
with AI it will be a bewildering variety of 
software that we cannot yet describe. I’m 
not aware of any large movement calling 
for regulation either inside or outside AI, 
because we don’t know how to write such 
regulation. 

Q: Should we start tracking AI research 
as we track fissile material? Who should 
do the policing? 

A: I think the right approach is to build 
the issue directly into how practitioners 
define what they do. No one in civil 
engineering talks about “building bridges 
that don’t fall down.” They just call it 
“building bridges.” Essentially all fusion 
researchers work on containment as 
a matter of course; uncontained fusion 
reactions just aren’t useful. Right now 
we have to say “AI that is probably 
beneficial,” but eventually that will just 
be called “AL” [We must] redirect the 
field away from its current goal of building 
pure intelligence for its own sake, regard- 
less of the associated objectives and their 
consequences. ■ 
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Data, privacy, and the greater good 

Eric Horvitz^* and Deirdre Mulligan^* 

Large-scale aggregate analyses of anonymized data can yield valuable results and insights that 
address public health challenges and provide new avenues for scientific discovery. These methods 
can extend our knowledge and provide new tools for enhancing health and wellbeing. However, they 
raise questions about how to best address potential threats to privacy while reaping benefits for 
individuals and to society as a whole. The use of machine learning to make leaps across informational 
and social contexts to infer health conditions and risks from nonmedical data provides 
representative scenarios for reflections on directions with balancing innovation and regulation. 



W hat if analyzing Twitter tweets or Face- 
book posts could identify new mothers 
at risk for postpartum depression (PPD)? 
Despite PPD’s serious consequences, early 
identification and prevention remain 
difficult. Absent a history of depression, detection 
is largely dependent on new mothers’ self-reports. 
But researchers found that shifts in sets of activ- 
ities and language usage on Facebook are pre- 
dictors of PPD {!) (see the photo). This is but one 
example of promising research that uses machine 
learning to derive and leverage health-related in- 
ferences from the massive flows of data about in- 
dividuals and populations generated through social 
media and other digital data streams. At the same 
time, machine learning presents new challenges 
for protecting individual priva(y and ensuring fair 
use of data. We need to strike a new balance be- 
tween controls on collecting information and con- 
trols on how it is used, as well as pursue auditable 
and accountable technologies and systems that 
facilitate greater use-based privacy protections. 

Researchers have coined terms, such as digi- 
tal disease detection (2) and infodemiology {3), 
to define the new science of harnessing diverse 
streams of digital information to inform public 
health and policy, e.g., earlier identification of epi- 
demics, (4) modeling communicability and flow of 
illness (5), and stratifying individuals at risk for 
illness (d). This new form of health research can 
also inform and extend understandings drawn from 
traditional health records and human subjects re- 
search. For example, the detection of adverse drug 
reactions could be improved by jointly leveraging 
data from the U.S. Food and Drug Administration’s 
Adverse Event Reporting System and anonymized 
search logs (7). Search logs can serve as a large- 
scale sensing system that can be used for drug 
safety surveillance— pharmacovigilance. 

Infodemiology studies are fypically large-scale 
aggregate analyses of anonymized data— publicly 
disclosed or privately held— that yield results and 
insights on public health questions across popula- 
tions. However, some methods and models can be 
< aimed at making inferences about unique individ- 
I uals that could drive actions, such as alerting or 
^ providing digital nudges, to improve individual 
I or public health outcomes. 

^ 
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Although digital nudging shows promise, a re- 
cent flare-up in the United Kingdom highlights 
the privacy concerns it can ignite. A Twitter suicide- 
prevention application called Good Samaritan 
monitored individuals’ tweets for words and 
phrases indicating a potential mental health crisis. 
The app notified the person’s followers so they 



could intervene to avert a potential suicide. But 
the app was shuttered after public outcry drew reg- 
ulator concern (5). Critics worried the app would 
encourage online stalkers and bullies to target vul- 
nerable individuals and collected 1200 signatures 
on a petition arguing that the app breached users’ 
privacy by collecting, processing, and sharing sen- 
sitive information. Despite the developers’ laudable 
goal of preventing suicide, the nonprofit was chas- 
tised for playing fast and loose with the privacy and 
mental health of those it was seeking to save {9). 

Machine learning can facilitate leaps across infor- 
mational and social contexts, making “category- 
jumping” inferences about health conditions or 
propensities from nonmedical data generated far 
outside the medical context. The implications for 
privacy are profound. Category-jumping inferences 
may reveal attributes or conditions an individual has 
specifically withheld from others. To protect against 
such violations, the United States heavily regulates 
health care privacy. But, although information about 
health conditions garnered from health care treat- 
ment and payment must be handled in a manner 
that respects patient privacy, machine learning and in- 
ference can sidestep many of the existing protections. 



Even when not category-jumping, machine learning 
can be used to draw powerftil and compromising in- 
ferences fixrm self-disclosed, seemingly benign data or 
readily observed behavior. These inferences can un- 
dermine a basic goal of many privacy laws— to allow 
individuals to control who knows what about them. 
Machine learning and inference makes it increasingly 
difficult for individuals to understand what others can 
know about them based on what they have explicitly 
or implicitly shared. And these computer-generated 
channels of information about health conditions 
join other technically created fissures in existing legal 
protections for health privacy. In particular, it is dif- 
ficult to reliably deidentify publicly shared data sets, 
given the enormous amount and variety of ancillaiy 
data that can be used to reidentify individuals. 

The capacities of machine learning expose the fim- 
damental limitations of existing US. privacy rules that 
tie the privacy protection of an individual’s health sta- 
tus to specific contexts or specific types of informa- 
tion a priori identified as health informatioa Health 
privacy regulations and privacy laws in the United 
States generally are based on the assumption 
that the semantics of data are relatively fixed 
and knowable and reside in isolated contexts. 
Machine learning techniques can instead be used 
to infer new meaning within and across contexts 
and is generally unencumbered by privacy rules 
in the United States. Using publicly available 
Twitter posts to infer risk of PPD, for example, 
does not run afoul of existing privacy law. This 
might be unsurprising, and seem unproblematic, 
given that the posts were publicly shared, but 
there are troubling consequences. 

Current privacy laws often do double duty. At a 
basic level, they limit who has access to infor- 
mation about a person. This implicitly limits the 
extent to which that information influences 
decision-making and thus doubles as a limit on 
the opportunities for information to fiiel dis- 
crimination. Because of the heightened privacy 
sensitivities and concerns with health-related dis- 
crimination, we have additional laws that regulate 
the use of health information outside the health care 
context US. laws specifically limit the use of some 
health information in ways considered unfeir. For 
example, credit-reporting agencies are generally pro- 
hibited fix)m providing medical information to make 
decisions about employment, credit, or housing. The 
Americans with Disabilities Act (ADA) prohibits dis- 
crimination on the basis of substantial physical or 
mental disabilities— or even a mistaken belief that an 
individual suffers fix)m such a disability. If machine 
learning is used to infer that an individual suffers 
fix)m a physical or mental impairment, an employer 
who bases a hiring decision on it, even if the in- 
ference is wrong, would violate the law. 

But the ADA does not prohibit discrimination based 
on predispositions for such disabilities (10). Machine 
learning might discover those, too. In theory the 
Genetic Information Non-Discriiriination Act (GINA) 
should fill this gap by protecting people genetically 
predisposed to a disease. But again, machine learn- 
ing exposes cracks in this protection. Although 
GINA prohibits discriiriination based on informa- 
tion derived from genetic tests or a family history of 
a disease (11), it does not limit the use of information 




Machine learning can make “category-jumping” infer- 
ences about health. New mother’s activities and language 
usage on social media are predictors of postpartum 
depression. 
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about such a disposition— even if it is grounded in 
genetics— inferred through machine learning tech- 
niques that mine other sorts of data. In other words, 
machine learning that predicts fiiture health status 
from nongenetic information— including health status 
changes due to genetic predisposition— would cir- 
cumvent existing legal protections Q2). 

Just as machine learning can expose secrets, it 
facilitates social sorting— placing individuals into 
categories for differential treatment— with good or 
bad intent and positive or negative outcomes. The 
methods used to classify individuals as part of ben- 
eficial public health programs and nudges can just 
as easily be used for more nefarious purposes, such 
as discrimination to protect organizational profits. 

Policy-makers in the United States and else- 
where are just beginning to address the challenges 
that machine learning and inference pose to com- 
mitments to privacy and equal treatment. Although 
not specifically focused on health information, 
reports issued by the White House— discuss the 
potential for large-scale data analyses to result in 
discrimination (i3)— and the Federal Trade Com- 
mission (FTC) have suggested new efforts to pro- 
tect privacy, regulate harmful uses of information, 
and increase transparency. 

The FTC is the key agency policing unfair and 
deceptive practices in the commercial marketplace, 
including those that touch on the privacy and se- 
curity of personal information. Its proposed privacy 
framework encourages companies to combine tech- 
nical and policy mechanisms to protect against re- 
identification. The FTC’s proposed rules would 
work to ensure that data are both “not reasonably 
identifiable” and accompanied by public company 
commitments not to reidentify it. The same pri- 
vacy rules should apply to downstream users of the 
data {14). This approach is promising for machine 
learning and other areas of artificial intelligence 
that rely on data-centric analyses. It allows learning 
from laige data sets— and sharing them— by encour- 
aging companies to reduce the risks that data pcx)ls 
and data sharing pose for individual privacy. 

The FTC proposal grows, in part, fix>m recent agen- 
cy actions focused on inferences that we have deemed 
“context-jumping.” hi one hi^-profile case, Netflix 
publicly released data sets to support a competition 
to improve their recommendation algorithm. When 
outside researchers used ancillaiy data to reidenttfy 
and infer sensitive attributes about individuals from 
the Netflix data sets, the FTC worked with the com- 
pany to limit fiiture public disclosures— setting out 
the limits discussed above. In a similar vein, the FTC 
objected to a change in Facebook’s defaults that ex- 
posed individuals’ group affiliations from which 
sensitive information, such as political views and 
sexual orientation, could be inferred {15). 

Additionally, the FTC has made efforts to ensure 
that individuals can control tracking in the online 
and mobile environments. These are in part due to 
the nonobvious inferences that can be drawn from 
vast collections of data {16-18) and the subsequent 
risks to consumers, who may be placed in classif- 
ications that single them out for specific treatment 
in the marketplace {19, 20). In a related context, 
the FTC recommended that Congress require data 
brokers— companies that collect consumers’ personal 



information and resell or share that information 
with others— to clearly disclose to consumers in- 
formation about the data they collect, as well as 
the fact that they derive inferences from it {21). 
Here, too, the FTC appears concerned with not just 
the raw data, but inferences firom its analysis. 

The Obama Administration’s Big Data Initiative 
has also considered the risks to privacy posed by 
machine learning and the potential downsides of 
using machine inferences in the commercial mar- 
ketplace {22, 23), concluding that we need to up- 
date our privacy rules, increase technical expertise 
in consumer protection and civil rights agencies to 
address novel discrimination issues arising from 
big data, provide individuals with privacy preserv- 
ing tools that allow the to control the collection and 
manage the use of personal information, as well as 
increase transparency into how companies use and 
trade data. The Admimstration is also concerned 
with the use of machine learning in policing and 
national security. The White House report called for 
increased technical expertise to help civil rights and 
consumer protection agencies identify, investigate, 
and resolve uses of big data analytics that have a 
discriminatoiy impact on protected classes {24). 

Note that reports and proposals from the Ad- 
ministration distinctly emphasize policies and reg- 
ulations focused on data use rather than collection. 
While acknowledging the need for tools that allow 
consumers to control when and how their data is 
collected, the Administration recommendations fo- 
cus on empowering individuals to participate in 
decisions about fiiture uses and disclosures of col- 
lected data {25). A separate report by the President’s 
Council of Advisors on Science and Technology 
(PCAST) concluded that this was a more ftuitftil di- 
rection for technical protections. Both reports sug- 
gest that use-based protections better address the 
latent meaning of data— inferences drawn from 
data using machine learning— and can adapt to the 
scale of the data-rich and connected environment of 
the future {26). The Administration called for col- 
laborative efforts to ensure that regulations in the 
health context will allow society to reap the benefits 
and mitigate the risks posed by machine learning 
and inferences. Use-based approaches are often fav- 
ored by industry, as well, which tends to view data 
as akin to a natural resource to be mined for com- 
mercial and public benefit, and industry is resistant 
to efforts to constrain data collection. 

Although incomplete and unlikely to be acted 
upon by the current gridlocked Congress, adop- 
tion of these recommendations would increase 
transparency about data’s collection, use, and con- 
sequences. Along with efforts to identify and con- 
strain discriminatory or unfair uses of data and 
inferences, they are promising steps. They also align 
with aspects of existing European privacy laws 
concerned with the transparency and fairness of 
data processing, particularly the risks to indi- 
viduals of purely automated decision-making. 

Current European Union (EU) law requires en- 
tities to provide individuals with access to the data 
on which decisions are rendered, as well as infor- 
mation about decision criteria [see Articles 12 and 
15 of {2T)\. Although currently governed by a Europe- 
wide directive, both provisions are a matter of na- 



tional law. What exactly individuals receive when 
they request access to their data and to processing 
logic varies by country, as does the implementation 
of the limitation on “purely automated” processing. 
The EU is expected to adopt a data privary regula- 
tion that will supplant local law, with a single na- 
tional standard. Although the current draft includes 
parallel provisions, their final form is not yet known 
nor is how they will ultimately be interpreted {27). 
In theory, a new EU requirement to disclose the 
logic of processing could apply quite broadly, with 
implications for public access to data analytics and 
algorithms. In the interim, a decision expected this 
summer in a case before the European Court of 
Justice may provide some detail as to what level 
of access to both data and the logic of processing is 
currently required under the EU Directive {28). 

Improving the transparency of data processing 
to data subjects is both important and challenging. 
Although the goal maybe to promote actual under- 
standing of the workings or likely outputs of ma- 
chine learning and reasoning methods, the workflows 
and dynamism of algorithms and decision criteria 
may be difficult to characterize and explain. For ex- 
ample, popular convolutional neural-network learn- 
ing procedures (commonly referred to as “deep 
learning”) automatically induce rich, multilayered 
representations that their developers themselves 
may not understand with clarity. Although high- 
level descriptions of procedures and representations 
might be provided, even an accomplished program- 
mer with access to the source code would be un- 
able to describe the precise operation of such a 
system or predict the output of a given set of inputs. 

Data’s meaning has become a moving target. 
Data sets can be easily combined to reidentify data 
sets thought deidentified, and sensitive knowledge 
can be inferred from benign data that are routine- 
ly and promiscuously shared. These pose difficul- 
ties for current U.S. legal approaches to privacy 
protection that regulate data on the basis of its 
identifiability and express meaning. 

Use-based approaches are driven, in part, by the 
realization that focusing solely on limiting data col- 
lection is inadequate. In a way, this presupposes 
that data are an unalloyed good that should be 
collected on principle, whenever and vdierever pos- 
sible. Whereas we are not ready to abandon limits on 
data collection, we agree that use-based regulations, 
although challenging to implement, are an impor- 
tant part of the fiiture legal landscape— and will 
help to advance privacy, equality, and the public 
good. To advance transparency and to balance the 
constraints they impose, use-based approaches would 
need to emphasize access, accuracy, and correction 
rights for individuals. 

The evolution of regulations for health informa- 
tion, although incomplete, provides a usefiil map for 
thinking about the challenges and opportunities we 
face today and frames potential solutions. In health 
care, privacy rules were joined by nonciiscrimination 
rules and always were accompanied by special pro- 
visions to support research. Today, they are being 
joined by collective governance models designed 
to encourage pooling of data in biobanks that sup- 
port research on health conditions while protecting 
collective interests in privacy. 
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Despite practical challenges, we are hopefiil that 
informed discussions among policy-makers and the 
public about data and the capabilities of machine 
learning, will lead to insightful designs of programs 
and policies that can balance the goals of protecting 
privacy and ensuring fairness with those of reaping 
the benefits to scientific research and to individual 
and public health. Our commitments to privacy and 
fairness are evergreen, but our policy choices must 
adapt to advance them, and support new tech- 
niques for deepening our knowledge. 
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perspectives, and prospects 
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Machine learning addresses the question of how to build computers that improve 
automatically through experience. It is one of today’s most rapidly growing technical fields, 
lying at the intersection of computer science and statistics, and at the core of artificial 
intelligence and data science. Recent progress in machine learning has been driven both by 
the development of new learning algorithms and theory and by the ongoing explosion in the 
availability of online data and low-cost computation. The adoption of data-intensive 
machine-learning methods can be found throughout science, technology and commerce, 
leading to more evidence-based decision-making across many walks of life, including 
health care, manufacturing, education, financial modeling, policing, and marketing. 



M achine learning is a discipline focused 
on two interrelated questions: How can 
one construct computer systems that auto- 
matically improve through experience? 
and What are the fundamental statistical- 
computational-information-theoretic laws that 
govern all learning systems, including computers, 
humans, and organizations? The study of machine 
learning is important both for addressing these 
fundamental scientific and engineering ques- 
tions and for the highly practical computer soft- 
ware it has produced and fielded across many 
applications. 

Machine learning has progressed dramati- 
cally over the past two decades, from laboratory 
curiosity to a practical technology in widespread 
commercial use. Within artificial intelligence (AI), 
machine learning has emerged as the method 
of choice for developing practical software for 
computer vision, speech recognition, natural lan- 
guage processing, robot control, and other ap- 
plications. Many developers of AI systems now 
recognize that, for many applications, it can be 
far easier to train a system by showing it exam- 
ples of desired input-output behavior than to 
program it manually by anticipating the desired 
response for aH possible inputs. The effect of ma- 
chine learning has also been felt broadly across 
computer science and across a range of indus- 
tries concerned with data-intensive issues, such 
as consumer services, the diagnosis of faults in 
complex systems, and the control of logistics 
chains. There has been a similarly broad range of 
effects across empirical sciences, from biology to 
cosmology to social science, as machine-learning 
methods have been developed to analyze high- 
throughput experimental data in novel ways. See 
Fig. 1 for a depiction of some recent areas of ap- 
plication of machine learning. 

A learning problem can be defined as the 
problem of improving some measure of perform- 
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ance when executing some task, through some 
type of training experience. For example, in learn- 
ing to detect credit-card fraud, the task is to as- 
sign a label of ‘Traud” or “not fraud” to any given 
credit-card transaction. The performance metric 
to be improved might be the accuracy of this 
fraud classifier, and the training experience might 
consist of a collection of historical credit-card 
transactions, each labeled in retrospect as fraud- 
ulent or not. Alternatively, one might define a 
different performance metric that assigns a higher 
penalty when ‘fraud” is labeled “not fraud” than 
when “not fraud” is incorrectly labeled “fraud.” 
One might also define a different type of training 
experience— for example, by including unlab- 
eled credit-card transactions along with labeled 
examples. 

A diverse array of machine-learning algorithms 
has been developed to cover the wide variety of 
data and problem types exhibited across differ- 
ent machine-learning problems (i, 2). Conceptual- 
ly, machine-learning algorithms can be viewed as 
searching through a large space of candidate 
programs, guided by training experience, to find 
a program that optimizes the performance metric. 
Machine-learning algorithms vaiy greatly, in part 
by the way in which they represent candidate 
programs (e.g., decision trees, mathematical func- 
tions, and general programming languages) and in 
part by the way in which they search through this 
space of programs (e.g., optimization algorithms 
with well-understood convergence guarantees 
and evolutionary search methods that evaluate 
successive generations of randomly mutated pro- 
grams). Here, we focus on approaches that have 
been particularly successful to date. 

Many algorithms focus on function approxi- 
mation problems, where the task is embodied 
in a fimction (e.g., given an input transaction, out- 
put a “fraud” or “not fraud” label), and the learn- 
ing problem is to improve the accuracy of that 
function, with experience consisting of a sample 
of known input-output pairs of the function. In 
some cases, the function is represented explicit- 
ly as a parameterized functional form; in other 
cases, the function is implicit and obtained via a 
search process, a factorization, an optimization 
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Despite practical challenges, we are hopefiil that 
informed discussions among policy-makers and the 
public about data and the capabilities of machine 
learning, will lead to insightful designs of programs 
and policies that can balance the goals of protecting 
privacy and ensuring fairness with those of reaping 
the benefits to scientific research and to individual 
and public health. Our commitments to privacy and 
fairness are evergreen, but our policy choices must 
adapt to advance them, and support new tech- 
niques for deepening our knowledge. 
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machine-learning methods can be found throughout science, technology and commerce, 
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M achine learning is a discipline focused 
on two interrelated questions: How can 
one construct computer systems that auto- 
matically improve through experience? 
and What are the fundamental statistical- 
computational-information-theoretic laws that 
govern all learning systems, including computers, 
humans, and organizations? The study of machine 
learning is important both for addressing these 
fundamental scientific and engineering ques- 
tions and for the highly practical computer soft- 
ware it has produced and fielded across many 
applications. 

Machine learning has progressed dramati- 
cally over the past two decades, from laboratory 
curiosity to a practical technology in widespread 
commercial use. Within artificial intelligence (AI), 
machine learning has emerged as the method 
of choice for developing practical software for 
computer vision, speech recognition, natural lan- 
guage processing, robot control, and other ap- 
plications. Many developers of AI systems now 
recognize that, for many applications, it can be 
far easier to train a system by showing it exam- 
ples of desired input-output behavior than to 
program it manually by anticipating the desired 
response for aH possible inputs. The effect of ma- 
chine learning has also been felt broadly across 
computer science and across a range of indus- 
tries concerned with data-intensive issues, such 
as consumer services, the diagnosis of faults in 
complex systems, and the control of logistics 
chains. There has been a similarly broad range of 
effects across empirical sciences, from biology to 
cosmology to social science, as machine-learning 
methods have been developed to analyze high- 
throughput experimental data in novel ways. See 
Fig. 1 for a depiction of some recent areas of ap- 
plication of machine learning. 

A learning problem can be defined as the 
problem of improving some measure of perform- 
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ance when executing some task, through some 
type of training experience. For example, in learn- 
ing to detect credit-card fraud, the task is to as- 
sign a label of ‘Traud” or “not fraud” to any given 
credit-card transaction. The performance metric 
to be improved might be the accuracy of this 
fraud classifier, and the training experience might 
consist of a collection of historical credit-card 
transactions, each labeled in retrospect as fraud- 
ulent or not. Alternatively, one might define a 
different performance metric that assigns a higher 
penalty when ‘fraud” is labeled “not fraud” than 
when “not fraud” is incorrectly labeled “fraud.” 
One might also define a different type of training 
experience— for example, by including unlab- 
eled credit-card transactions along with labeled 
examples. 

A diverse array of machine-learning algorithms 
has been developed to cover the wide variety of 
data and problem types exhibited across differ- 
ent machine-learning problems (i, 2). Conceptual- 
ly, machine-learning algorithms can be viewed as 
searching through a large space of candidate 
programs, guided by training experience, to find 
a program that optimizes the performance metric. 
Machine-learning algorithms vaiy greatly, in part 
by the way in which they represent candidate 
programs (e.g., decision trees, mathematical func- 
tions, and general programming languages) and in 
part by the way in which they search through this 
space of programs (e.g., optimization algorithms 
with well-understood convergence guarantees 
and evolutionary search methods that evaluate 
successive generations of randomly mutated pro- 
grams). Here, we focus on approaches that have 
been particularly successful to date. 

Many algorithms focus on function approxi- 
mation problems, where the task is embodied 
in a fimction (e.g., given an input transaction, out- 
put a “fraud” or “not fraud” label), and the learn- 
ing problem is to improve the accuracy of that 
function, with experience consisting of a sample 
of known input-output pairs of the function. In 
some cases, the function is represented explicit- 
ly as a parameterized functional form; in other 
cases, the function is implicit and obtained via a 
search process, a factorization, an optimization 
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S Fig. 1. Applications of machine learning. Machine learning is having a substantial effect on many 

^ areas of technology and science: examples of recent applied success stories include robotics and 

o autonomous vehicle control (top left), speech processing and natural language processing (top 

w right), neuroscience research (middle), and applications in computer vision (bottom). [The middle 

5 panel is adapted from (29). The images in the bottom panel are from the ImageNet database: object 

o recognition annotation is by R. Girshick.] 
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procedure, or a simulation-based procedure. Even 
when implicit, the function generally depends 
on parameters or other tunable degrees of free- 
dom, and training corresponds to finding values 
for these parameters that optimize the perform- 
ance metric. 

Whatever the learning algorithm, a key scien- 
tific and practical goal is to theoretically character- 
ize the capabilities of specific learning algorithms 
and the inherent difficulty of any given learning 
problem: How accurately can the algorithm learn 
from a particular type and volume of training 
data? How robust is the algorithm to errors in its 
modeling assumptions or to errors in the train- 
ing data? Given a learning problem with a given 
volume of training data, is it possible to design a 
successful algorithm or is this learning problem 
fundamentally intractable? Such theoretical char- 
acterizations of machine-learning algorithms and 
problems typically make use of the familiar frame- 
works of statistical decision theory and compu- 
tational complexity theory. In fact, attempts to 
characterize machine-learning algorithms the- 
oretically have led to blends of statistical and 
computational theory in vdiich the goal is to simul- 
taneously characterize the sample complexity 
(how much data are required to learn accurately) 
and the computational complexity (how much 
computation is required) and to specify how these 
depend on features of the learning algorithm such 
as the representation it uses for what it learns 
{3-6). A specific form of computational analysis 
that has proved particularly useful in recent 
years has been that of optimization theory, 
with upper and lower bounds on rates of con- 
vergence of optimization procedures merging 
well with the formulation of machine-learning 
problems as the optimization of a performance 
metric (7, 8). 

As a field of study, machine learning sits at the 
crossroads of computer science, statistics and a 
variety of other disciplines concerned with auto- 
matic improvement over time, and inference and 
decision-making under uncertainty. Related dis- 
ciplines include the psychological study of human 
learning, the study of evolution, adaptive control 
theory, the study of educational practices, neuro- 
science, organizational behavior, and economics. 
Although the past decade has seen increased cross- 
talk with these other fields, we are just beginning 
to tap the potential synergies and the diversity 
of formalisms and experimental methods used 
across these multiple fields for studying systems 
that improve with experience. 

Drivers of machine-learning progress 

The past decade has seen rapid growth in the 
ability of networked and mobile computing sys- 
tems to gather and transport vast amounts of 
data, a phenomenon often referred to as “Big 
Data.” The scientists and engineers who collect 
such data have often turned to machine learn- 
ing for solutions to the problem of obtaining 
useful insights, predictions, and decisions from 
such data sets. Indeed, the sheer size of the data 
makes it essential to develop scalable proce- 
dures that blend computational and statistical 
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considerations, but the issue is more than the 
mere size of modern data sets; it is the granular, 
personalized nature of much of these data. Mo- 
bile devices and embedded computing permit 
large amounts of data to be gathered about in- 
dividual humans, and machine-learning algo- 
rithms can learn from these data to customize 
their services to the needs and circumstances 
of each individual. Moreover, these personalized 
services can be connected, so that an overall ser- 
vice emerges that takes advantage of the wealth 
and diversity of data from many individuals 
while still customizing to the needs and circum- 
stances of each. Instances of this trend toward 
capturing and mining large quantities of data to 
improve services and productivity can be found 
across many fields of commerce, science, and 
government. Historical medical records are used 
to discover which patients will respond best 
to which treatments; historical traffic data are 
used to improve traffic control and reduce con- 
gestion; historical crime data are used to help 
allocate local police to specific locations at spe- 
cific times; and large experimental data sets are 
captured and curated to accelerate progress in 
biology, astronomy, neuroscience, and other data- 
intensive empirical sciences. We appear to be at 
the beginning of a decades-long trend toward in- 
creasingly data-intensive, evidence-based decision- 
making across many aspects of science, commerce, 
and government. 

With the increasing prominence of large-scale 
data in all areas of human endeavor has come a 
wave of new demands on the underlying machine- 
learning algorithms. For example, huge data sets 
require computationally tractable algorithms, high- 
ly personal data raise the need for algorithms 
that minimize privacy effects, and the availabil- 
ity of huge quantities of unlabeled data raises 
the challenge of designing learning algorithms 
to take advantage of it. The next sections survey 
some of the effects of these demands on recent 



work in machine-learning algorithms, theory, and 
practice. 

Core methods and recent progress 

The most widely used machine-learning methods 
are supervised learning methods (1). Supervised 
learning systems, including spam classifiers of 
e-mail, face recognizers over images, and med- 
ical diagnosis systems for patients, all exemplify 
the function approximation problem discussed 
earlier, where the training data take the form of 
a collection of (x, y) pairs and the goal is to 
produce a prediction y* in response to a query 
X*. The inputs x may be classical vectors or they 
maybe more complex objects such as documents, 
images, DNA sequences, or graphs. Similarly, 
many different kinds of output y have been studied 
Much progress has been made by focusing on 
the simple binary classification problem in which 
y takes on one of two values (for example, “spam” 
or “not spam”), but there has also been abun- 
dant research on problems such as multiclass 
classification (where y takes on one of A' labels), 
multilabel classification (where y is labeled simul- 
taneously by several of the K labels), ranking 
problems (where y provides a partial order on 
some set), and general structured prediction 
problems (where ?/ is a combinatorial object such 
as a graph, whose components may be required 
to satisfy some set of constraints). An example 
of the latter problem is part-of-speech tagging, 
where the goal is to simultaneously label every 
word in an input sentence x as being a noun, 
verb, or some other part of speech. Supervised 
learning also includes cases in which y has real- 
valued components or a mixture of discrete and 
real-valued components. 

Supervised learning systems generally form 
their predictions via a learned mapping f{x), 
which produces an output y for each input x (or 
a probability distribution over y given x). Many 
different forms of mapping / exist, including 



decision trees, decision forests, logistic regres- 
sion, support vector machines, neural networks, 
kernel machines, and Bayesian classifiers (i). A 
variety of learning algorithms has been proposed 
to estimate these different types of mappings, and 
there are also generic procedures such as boost- 
ing and multiple kernel learning that combine 
the outputs of multiple learning algorithms. 
Procedures for learning /from data often make 
use of ideas from optimization theory or numer- 
ical analysis, with the specific form of machine- 
learning problems (e.g., that the objective function 
or frinction to be integrated is often the sum over 
a large number of terms) driving innovations. This 
diversity of learning architectures and algorithms 
reflects the diverse needs of applications, with 
different architectures capturing different kinds 
of mathematical structures, offering different lev- 
els of amenability to post-hoc visualization and 
explanation, and providing varying trade-offs 
between computational complexity, the amount 
of data, and performance. 

One high-impact area of progress in supervised 
learning in recent years involves deep networks, 
which are multilayer networks of threshold units, 
each of which computes some simple param- 
eterized flmction of its inputs (P, 10). Deep learning 
systems make use of gradient-based optimiza- 
tion algorithms to adjust parameters throughout 
such a multilayered network based on errors at 
its output. Exploiting modern parallel comput- 
ing architectures, such as graphics processing 
units originally developed for video gaming, it 
has been possible to build deep learning sys- 
tems that contain billions of parameters and 
that can be trained on the very large collections 
of images, videos, and speech samples available 
on the Internet. Such large-scale deep learning 
systems have had a major effect in recent years 
in computer vision {11) and speech recognition 
{12), where they have yielded major improve- 
ments in performance over previous approaches 
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Fig. 2. Automatic generation of text captions for images with deep networks. A convolutional neural network is trained to interpret images, and its 
output is then used by a recurrent neural network trained to generate a text caption (top). The sequence at the bottom shows the word-by-word focus of 
the network on different parts of input image while it generates the caption word-by-word. [Adapted with permission from (30)] 
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Fig. 3. Topic models. Topic modeling is a methodology for analyzing documents, where a document is viewed as a collection of words, and the words in 
the document are viewed as being generated by an underlying set of topics (denoted by the colors in the figure). Topics are probability distributions 
across words (leftmost column), and each document is characterized by a probability distribution across topics (histogram). These distributions are 
inferred based on the analysis of a collection of documents and can be viewed to classify, index, and summarize the content of documents. [From { 31 ). 
Copyright 2012, Association for Computing Machinery, Inc. Reprinted with permission] 



(see Fig. 2). Deep network methods are being 
actively pursued in a variety of additional appli- 
cations from natural language translation to 
collaborative filtering. 

The internal layers of deep networks can be 
viewed as providing learned representations of 
the input data. While much of the practical suc- 
cess in deep learning has come from supervised 
learning methods for discovering such repre- 
sentations, efforts have also been made to devel- 
op deep learning algorithms that discover useful 
representations of the input without the need for 
labeled training data {13). The general problem is 
referred to as unsupervised learning, a second 
paradigm in machine-learning research {2). 

Broadly, unsupervised learning generally in- 
volves the analysis of unlabeled data under as- 
sumptions about structural properties of the 
data (e.g., algebraic, combinatorial, or probabi- 
listic). For example, one can assume that data 
lie on a low-dimensional manifold and aim to 
identify that manifold explicitly from data. Di- 
mension reduction methods— including prin- 
cipal components analysis, manifold learning, 
factor analysis, random projections, and autoen- 
coders (i, 2)— make different specific assump- 
tions regarding the underlying manifold (e.g., 
that it is a linear subspace, a smooth nonlinear 
- manifold, or a collection of submanifolds). An- 
< other example of dimension reduction is the 

S topic modeling framework depicted in Fig. 3. 

^ A criterion function is defined that embodies 
o these assumptions— often making use of general 
w statistical principles such as maximum like- 
5 lihood, the method of moments, or Bayesian 
o integration— and optimization or sampling algo- 



rithms are developed to optimize the criterion. 
As another example, clustering is the problem 
of finding a partition of the observed data (and 
a rule for predicting future data) in the absence 
of explicit labels indicating a desired partition. 
A wide range of clustering procedures has been 
developed, all based on specific assumptions 
regarding the nature of a “cluster.” In both clus- 
tering and dimension reduction, the concern 
with computational complexity is paramount, 
given that the goal is to exploit the particularly 
large data sets that are available if one dis- 
penses v^th supervised labels. 

A third major machine-learning paradigm is 
reinforcement learning {14, i5). Here, the infor- 
mation available in the training data is inter- 
mediate between supervised and unsupervised 
learning. Instead of training examples that in- 
dicate the correct output for a given input, the 
training data in reinforcement learning are as- 
sumed to provide only an indication as to whether 
an action is correct or not; if an action is incor- 
rect, there remains the problem of finding the 
correct action. More generally, in the setting of 
sequences of inputs, it is assumed that reward 
signals refer to the entire sequence; the assign- 
ment of credit or blame to individual actions in the 
sequence is not directly provided. Indeed, although 
simplified versions of reinforcement learning 
knovm as bandit problems are studied, where it 
is assumed that rewards are provided after each 
action, reinforcement learning problems typically 
involve a general control-theoretic setting in 
which the learning task is to learn a control strat- 
egy (a “policy”) for an agent acting in an unknovm 
dynamical environment, where that learned strat- 



egy is trained to chose actions for any given state, 
with the objective of maximizing its expected re- 
ward over time. The ties to research in control 
theory and operations research have increased 
over the years, with formulations such as Markov 
decision processes and partially observed Mar- 
kov decision processes providing points of con- 
tact {15, 16). Reinforcement-learning algorithms 
generally make use of ideas that are familiar 
from the control-theory literature, such as policy 
iteration, value iteration, rollouts, and variance 
reduction, with innovations arising to address 
the specific needs of machine learning (e.g., large- 
scale problems, few assumptions about the un- 
known dynamical environment, and the use of 
supervised learning architectures to represent 
policies). It is also worth noting the strong ties 
between reinforcement learning and many dec- 
ades of work on learning in psychology and 
neuroscience, one notable example being the 
use of reinforcement learning algorithms to pre- 
dict the response of dopaminergic neurons in 
monkeys learning to associate a stimulus light 
with subsequent sugar reward {17). 

Although these three learning paradigms help 
to organize ideas, much current research involves 
blends across these categories. For example, semi- 
supervised learning makes use of unlabeled data 
to augment labeled data in a supervised learning 
context, and discriminative training blends ar- 
chitectures developed for unsupervised learning 
with optimization formulations that make use 
of labels. Model selection is the broad activity of 
using training data not only to fit a model but 
also to select from a family of models, and the 
fact that training data do not directly indicate 
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which model to use leads to the use of algo- 
rithms developed for bandit problems and to 
Bayesian optimization procedures. Active learn- 
ing arises when the learner is allowed to choose 
data points and query the trainer to request tar- 
geted information, such as the label of an other- 
wise unlabeled example. Causal modeling is the 
effort to go beyond simply discovering predictive 
relations among variables, to distinguish which 
variables causally influence others (e.g., a high 
white-blood-cell count can predict the existence 
of an infection, but it is the infection that causes 
the high white-cell count). Many issues influence 
the design of learning algorithms across all of 
these paradigms, including whether data are 
available in batches or arrive sequentially over 
time, how data have been sampled, require- 
ments that learned models be interpretable by 
users, and robustness issues that arise when 
data do not fit prior modeling assumptions. 

Emerging trends 

The field of machine learning is sufficiently young 
that it is still rapidly expanding, often by invent- 
ing new formalizations of machine-learning 
problems driven by practical applications. (An 
example is the development of recommendation 
systems, as described in Fig. 4.) One major trend 
driving this expansion is a growing concern with 
the environment in which a machine-learning 
algorithm operates. The word “environment” 
here refers in part to the computing architecture; 
whereas a classical machine-learning system in- 
volved a single program running on a single ma- 
chine, it is now common for machine-learning 
systems to be deployed in architectures that in- 
clude many thousands or ten of thousands of 
processors, such that communication constraints 
and issues of parallelism and distributed pro- 
cessing take center stage. Indeed, as depicted 
in Fig. 5, machine-learning systems are increas- 
ingly taking the form of complex collections of 
software that run on large-scale parallel and dis- 
tributed computing platforms and provide a range 
of algorithms and services to data analysts. 

The word “environment” also refers to the 
source of the data, which ranges from a set of 
people who may have privacy or ownership con- 
cerns, to the analyst or decision-maker who may 
have certain requirements on a machine-learning 
system (for example, that its output be visual- 
izable), and to the social, legal, or political frame- 
work surrounding the deployment of a system. 
The environment also may include other machine- 
learning systems or other agents, and the overall 
collection of systems may be cooperative or ad- 
versarial. Broadly speaking, environments pro- 
vide various resources to a learning algorithm 
and place constraints on those resources. Increas- 
ingly, machine-learning researchers are formalizing 
these relationships, aiming to design algorithms 
that are provably effective in various environ- 
ments and explicitly allow users to express and 
control trade-offs among resources. 

As an example of resource constraints, let us 
suppose that the data are provided by a set of 
individuals who wish to retain a degree of pri- 



vacy. Privacy can be formalized via the notion of 
“differential privacy,” which defines a probabi- 
listic channel between the data and the outside 
world such that an observer of the output of the 
channel cannot infer reliably whether particular 
individuals have supplied data or not (18). Clas- 
sical applications of differential privacy have 
involved insuring that queries (e.g., “what is the 
maximum balance across a set of accounts?”) to 
a privatized database return an answer that is 
close to that returned on the nonprivate data. 
Recent research has brought differential privacy 
into contact with machine learning, where que- 
ries involve predictions or other inferential asser- 
tions (e.g., “given the data I've seen so far, what is 
the probability that a new transaction is fraud- 
ulent?”) (19, 20). Placing the overall design of a 
privacy-enhancing machine-learning system 
within a decision-theoretic framework provides 
users with a tuning knob whereby they can choose 
a desired level of privacy that takes into account 
the kinds of questions that will be asked of the 
data and their own personal utility for the an- 
swers. For example, a person may be willing to 




Fig. 4. Recommendation ^sterns. A recommen- 
dation system is a machine-learning system that is 
based on data that indicate links between a set 
of a users (e.g., people) and a set of items (e.g., 
products). A link between a user and a product 
means that the user has indicated an interest in 
the product in some fashion (perhaps by purchas- 
ing that item in the past). The machine-learning prob- 
lem is to suggest other items to a given user that he 
or she may also be interested in, based on the data 
across all users. 



reveal most of their genome in the context of 
research on a disease that runs in their family 
but may ask for more stringent protection if in- 
formation about their genome is being used to 
set insurance rates. 

Communication is another resource that needs 
to be managed within the overall context of a 
distributed learning system. For example, data 
may be distributed across distinct physical loca- 
tions because their size does not allow them to 
be aggregated at a single site or because of ad- 
ministrative boundaries. In such a setting, we may 
wish to impose a bit-rate communication con- 
straint on the machine-learning algorithm. Solving 
the design problem under such a constraint will 
generally show how the performance of the learn- 
ing system degrades under decrease in commu- 
nication bandwidth, but it can also reveal how 
the performance improves as the number of dis- 
tributed sites (e.g., machines or processors) in- 
creases, trading off these quantities against the 
amount of data (21, 22). Much as in classical in- 
formation theory, this line of research aims at 
fundamental lower bounds on achievable per- 
formance and specific algorithms that achieve 
those lower bounds. 

A major goal of this general line of research is 
to bring the kinds of statistical resources studied 
in machine learning (e.g., number of data points, 
dimension of a parameter, and complexity of a 
hypothesis class) into contact with the classical 
computational resources of time and space. Such 
a bridge is present in the “probably approximately 
correct” (PAC) learning framework, which studies 
the effect of adding a polynomial-time compu- 
tation constraint on this relationship among error 
rates, training data size, and other parameters of 
the learning algorithm (3). Recent advances in 
this line of research include various lower bounds 
that establish fundamental gaps in performance 
achievable in certain machine-learning prob- 
lems (e.g., sparse regression and sparse princi- 
pal components analysis) via polynomial-time 
and exponential-time algorithms (23). The core 
of the problem, however, involves time-data trade- 
offs that are far from the polynomial/exponential 
boundary. The large data sets that are increas- 
ingly the norm require algorithms whose time 
and space requirements are linear or sublinear in 
the problem size (number of data points or num- 
ber of dimensions). Recent research focuses on 
methods such as subsampling, random projec- 
tions, and algorithm weakening to achieve seal- 
ability while retaining statistical control (24, 25). 

The ultimate goal is to be able to supply time 
and space budgets to machine-learning systems 
in addition to accuracy requirements, with the 
system finding an operating point that allows 
such requirements to be realized. 

Opportunities and challenges 

Despite its practical and commercial successes, ^ 
machine learning remains a young field with g 
many underexplored research opportunities. g 
Some of these opportunities can be seen by con- ^ 
trasting current machine-learning approaches 5 
to the types of learning we observe in naturally o 
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Fig. 5. Data analytics stack. Scalable machine-learning systems are layered architectures that are 
built on parallel and distributed computing platforms. The architecture depicted here— an open- 
source data analysis stack developed in the Algorithms, Machines and People (AMP) Laboratory at 
the University of California, Berkeley— includes layers that interface to underlying operating systems: 
layers that provide distributed storage, data management, and processing: and layers that provide 
core machine-learning competencies such as streaming, subsampling, pipelines, graph processing, 
and model serving. 
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occurring systems such as humans and other 
animals, organizations, economies, and biological 
evolution. For example, whereas most machine- 
learning algorithms are targeted to learn one 
specific function or data model from one single 
data source, humans clearly learn many differ- 
ent skills and types of knowledge, from years 
of diverse training experience, supervised and 
unsupervised, in a simple-to-more-difficult se- 
quence (e.g., learning to crawl, then walk, then 
run). This has led some researchers to begin 
exploring the question of how to construct com- 
puter lifelong or never-ending learners that op- 
erate nonstop for years, learning thousands of 
interrelated skills or functions within an over- 
all architecture that allows the system to im- 
prove its ability to learn one skill based on 
having learned another (26-28). Another aspect 
of the analogy to natural learning systems sug- 
gests the idea of team-based, mixed-initiative 
learning. For example, whereas current machine- 
learning systems typically operate in isolation 
to analyze the given data, people often work 
in teams to collect and analyze data (e.g., biol- 
ogists have worked as teams to collect and an- 
alyze genomic data, bringing together diverse 
experiments and perspectives to make progress 
on this difficult problem). New machine-learning 
methods capable of working collaboratively with 
humans to jointly analyze complex data sets 
might bring together the abilities of machines 
to tease out subtle statistical regularities from 
massive data sets with the abilities of humans to 
draw on diverse background knowledge to gen- 
erate plausible explanations and suggest new 
hypotheses. Many theoretical results in machine 
learning apply to all learning systems, whether 
they are computer algorithms, animals, organ- 
izations, or natural evolution. As the field pro- 
gresses, we may see machine-learning theory 
and algorithms increasingly providing models 
for understanding learning in neural systems. 



organizations, and biological evolution and see 
machine learning benefit from ongoing studies 
of these other types of learning systems. 

As with any powerful technology, machine 
learning raises questions about which of its po- 
tential uses society should encourage and dis- 
courage. The push in recent years to collect new 
kinds of personal data, motivated by its eco- 
nomic value, leads to obvious privacy issues, as 
mentioned above. The increasing value of data 
also raises a second ethical issue: Who will have 
access to, and ownership of, online data, and who 
will reap its benefits? Currently, much data are 
collected by corporations for specific uses leading 
to improved profits, with little or no motive for 
data sharing. However, the potential benefits that 
society could realize, even from existing online 
data, would be considerable if those data were to 
be made available for public good. 

To illustrate, consider one simple example 
of how society could benefit from data that is 
already online today by using this data to de- 
crease the risk of global pandemic spread from 
infectious diseases. By combining location data 
from online sources (e.g., location data from cell 
phones, from credit-card transactions at retail 
outlets, and from security cameras in public places 
and private buildings) with online medical data 
(e.g., emergency room admissions), it would be 
feasible today to implement a simple system to 
telephone individuals immediately if a person 
they were in close contact with yesterday was just 
admitted to the emergency room with an infec- 
tious disease, alerting them to the symptoms they 
should watch for and precautions they should 
take. Here, there is clearly a tension and trade-off 
between personal privacy and public health, and 
society at large needs to make the decision on 
how to make this trade-off. The larger point of 
this example, however, is that, although the data 
are already online, we do not currently have the 
laws, customs, culture, or mechanisms to enable 



society to benefit from them, if it wishes to do so. 

In fact, much of these data are privately held and 
owned, even though they are data about each of 
us. Considerations such as these suggest that 
machine learning is likely to be one of the most 
transformative technologies of the 21st century. 
Although it is impossible to predict the future, it 
appears essential that society begin now to con- 
sider how to maximize its benefits. 
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REVIEW 

Advances in natural 
l an g ua ge processing 

Julia Hirschberg^* * and Christopher D. Manning^’^ 

Natural language processing employs computational techniques for the purpose of learning, 
understanding, and producing human language content. Early computational approaches to 
language research focused on automating the analysis of the linguistic structure of language 
and developing basic technologies such as machine translation, speech recognition, and speech 
synthesis. Today’s researchers refine and make use of such tools in real-world applications, 
creating spoken dialogue systems and speech-to-speech translation engines, mining social 
media for information about health or finance, and identifying sentiment and emotion toward 
products and services. We describe successes and challenges in this rapidly advancing area. 



O ver the past 20 years, computational lin- 
guistics has grown into both an exciting 
area of scientific research and a practical 
technology that is increasingly being in- 
corporated into consumer products (for 
example, in applications such as Apple’s Siri and 
Skype Translator). Four key factors enabled these 
developments: (i) a vast increase in computing 
power, (ii) the availability of very large amounts 
of linguistic data, (iii) the development of highly 
successful machine learning (ML) methods, and 
(iv) a much richer understanding of the structure 
of human language and its deployment in social 
contexts. In this Review, we describe some cur- 
rent application areas of interest in language 
research. These efforts illustrate computational 
approaches to big data, based on current cutting- 
edge methodologies that combine statistical anal- 
ysis and ML with knowledge of language. 

Computational linguistics, also known as nat- 
ural language processing (NLP), is the subfield 
of computer science concerned with using com- 
putational techniques to learn, understand, and 
produce human language content. Computation- 
al linguistic systems can have multiple purposes: 
The goal can be aiding human-human commu- 
nication, such as in machine translation (MT); 
aiding human-machine communication, such as 
with conversational agents; or benefiting both 
humans and machines by analyzing and learn- 
ing from the enormous quantity of human lan- 
guage content that is now available online. 

During the first several decades of work in 
computational linguistics, scientists attempted 
to write down for computers the vocabularies 
and rules of human languages. This proved a 
difficult task, owing to the variability, ambiguity, 
and context-dependent interpretation of human 
languages. For instance, a star can be either an 
astronomical object or a person, and “star” can 
be a noun or a verb. In another example, two in- 
terpretations are possible for the headline “Teacher 
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strikes idle kids,” depending on the noun, verb, and 
adjective assignments of the words in the sentence, 
as well as grammatical structure. Beginning in the 
1980s, but more widely in the 1990s, NLP was 
transformed by researchers starting to build mod- 
els over large quantities of empirical language 
data. Statistical or corpus (“body of words”)-based 
NLP was one of the first notable successes of 
the use of big data, long before the power of 
ML was more generally recognized or the term 
“big data” even introduced. 

A central finding of this statistical approach to 
NLP has been that simple methods using words, 
part-of-speech (POS) sequences (such as whether 
a word is a noun, verb, or preposition), or simple 
templates can often achieve notable results when 
trained on large quantities of data. Many text 
and sentiment classifiers are still based solely on 
the different sets of words (“bag of words”) that 
documents contain, without regard to sentence 
and discourse structure or meaning. Achieving 
improvements over these simple baselines can be 
quite difficult. Nevertheless, the best-performing 
systems now use sophisticated ML approaches 
and a rich understanding of linguistic structure. 
High-performance tools that identify syntactic 
and semantic information as well as information 
about discourse context are now available. One 
example is Stanford CoreNLP (i), which provides 
a standard NLP preprocessing pipeline that in- 
cludes POS tagging (with tags such as noun, verb, 
and preposition); identification of named entities, 
such as people, places, and organizations; parsing 
of sentences into their grammatical structures; 
and identifying co-references between noun 
phrase mentions (Fig. 1). 

Historically, two developments enabled the 
initial transformation of NLP into a big data field. 
The first was the early availability to researchers 
of linguistic data in digital form, particularly 
through the Linguistic Data Consortium (LDC) 
(2), established in 1992. Today, large amounts 
of digital text can easily be downloaded from 
the Web. Available as linguistically annotated 
data are large speech and text corpora anno- 
tated with POS tags, syntactic parses, semantic 
labels, annotations of named entities (persons, 
places, organizations), dialogue acts (statement. 



question, request), emotions and positive or neg- 
ative sentiment, and discourse structure (topic 
or rhetorical structure). Second, performance im- 
provements in NLP were spurred on by shared 
task competitions. Originally, these competitions 
were largely funded and organized by the U.S. 
Department of Defense, but they were later or- 
ganized by the research community itself, such 
as the CoNLL Shared Tasks (3). These tasks were 
a precursor of modem ML predictive modeling 
and analytics competitions, such as on Kaggle (4), 
in which companies and researchers post their 
data and statisticians and data miners from all over 
the world compete to produce the best models. 

A major limitation of NLP today is the fact that 
most NLP resources and systems are available 
only for high-resource languages (HRLs), such as 
English, French, Spanish, German, and Chinese. 
In contrast, many low-resource languages (LRLs)— 
such as Bengali, Indonesian, Punjabi, Cebuano, 
and Swahili— spoken and written by millions of 
people have no such resources or systems avail- 
able. A future challenge for the language commu- 
nity is how to develop resources and tools for 
hundreds or thousands of languages, not just a few. 

Machine translation 

Proficiency in languages was traditionally a hall- 
mark of a learned person. Although the social 
standing of this human skill has declined in the 
modern age of science and machines, translation 
between human languages remains cmcially im- 
portant, and MT is perhaps the most substantial 
way in which computers could aid human-human 
communication. Moreover, the ability of com- 
puters to translate between human languages 
remains a consummate test of machine intel- 
ligence: Correct translation requires not only 
the ability to analyze and generate sentences in 
human languages but also a humanlike under- 
standing of world knowledge and context, de- 
spite the ambiguities of languages. For example, 
the French word “bordel” straightforwardly means 
“brothel”; but if someone says “My room is un 
bordel,” then a translating machine has to know 
enough to suspect that this person is probably not 
running a brothel in his or her room but rather is 
saying “My room is a complete mess.” 

Machine translation was one of the first non- 
numeric applications of computers and was studied 
intensively starting in the late 1950s. However, the 
hand-built grammar-based systems of early dec- 
ades achieved very limited success. The field was 
transformed in the early 1990s when researchers 
at IBM acquired a large quantity of English and 
French sentences that were translations of each 
other (known as parallel text), produced as the 
proceedings of the bilingual Canadian Parliament. 
These data allowed them to collect statistics of 
word translations and word sequences and to 
build a probabilistic model of MT (5). 

Following a quiet period in the late 1990s, 
the new millennium brought the potent combina- 
tion of ample online text, including considerable 
quantities of parallel text, much more abundant 
and inexpensive computing, and a new idea 
for building statistical phrase-based MT systems 
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{6). Rather than translating word by word, the 
key advance is to notice that small word groups 
often have distinctive translations. The Japa- 
nese “mizu iro” is literally the sequence 

of two words (“water color”), but this is not the 
correct meaning (nor does it mean a type of 
painting); rather, it indicates a light, sl^-blue color. 
Such phrase-based MT was used by Franz Och in 
the development of Google Translate. 

This technology enabled the services we have 
today, which allow free and instant translation 
between many language pairs, but it still pro- 
duces translations that are only just serviceable 
for determining the gist of a passage. However, 
very promising work continues to push MT for- 
ward. Much subsequent research has aimed to 
better exploit the structure of human language 
sentences (i.e., their syntax) in translation sys- 
tems (7, 8), and researchers are actively building 
deeper meaning representations of language {9) 
to enable a new level of semantic MT. 

Finally, just in the past year, we have seen the 
development of an extremely promising approach 
to MT through the use of deep-learning-based 
sequence models. The central idea of deep learn- 
ing is that if we can train a model with several 
representational levels to optimize a final objec- 
tive, such as translation quality, then the model 
can itself learn intermediate representations 
that are useful for the task at hand. This idea 
has been explored particularly for neural net- 
work models in which information is stored in 
real-valued vectors, with the mapping between 
vectors consisting of a matrix multiplication fol- 
lowed by a nonlinearity, such as a sigmoid func- 
tion that maps the output values of the matrix 
multiplication onto [-1, 1]. Building large models 



of this form is much more practical with the 
massive parallel computation that is now econo- 
mically available via graphics processing units. For 
translation, research has focused on a particular 
version of recurrent neural networks, with enhanced 
‘long short-term memory” computational units 
that can better maintain contextual information 
from early until late in a sentence {10, 11) (Fig. 2). 
The distributed representations of neural networks 
are often very effective for capturing subtle seman- 
tic similarities, and neural MT systems have al- 
ready produced some state-of-the-art results {12, 13). 

A still-underexplored area in MT is getting ma- 
chines to have more of a sense of discourse, so 
that a sequence of sentences translates naturally— 
although work in the area has begun {14). Finally, 
MT is not necessarily a task for machines to do 
alone. Rather it can be reconceptualized as an op- 
portunity for computer-supported cooperative 
work that also exploits human skills {15). In such 
a system, machine intelligence is aimed at human- 
computer interface capabilities of giving effective 
suggestions and reacting productively to human 
input, rather than wholly replacing the skills and 
knowledge of a human translator. 

Spoken dialogue systems and 
conversational agents 

Dialogue has been a popular topic in NLP re- 
search since the 1980s. However, early work on 
text-based dialogue has now expanded to include 
spoken dialogue on mobile devices (e.g., Apple’s 
Siri, Amtrak’s Julie, Google Now, and Microsoft’s 
Cortana) for information access and task-based 
apps. Spoken dialogue systems (SDSs) also allow 
robots to help people with simple manual tasks 
[e.g., Manuela Veloso’s CoBots (id)] or provide 



therapy for less-abled persons [e.g., Maja Mataric’s 
socially assistive robots (i7)]. They also enable ava- 
tars to tutor people in interview or negotiation stra- 
tegies or to help with health care decisions {18, 19). 

The creation of SDSs, whether between hu- 
mans or between humans and artificial agents, 
requires tools for automatic speech recognition 
(ASR), to identify what a human says; dialogue 
management (DM), to determine what that hu- 
man wants; actions to obtain the information or 
perform the activity requested; and text-to-speech 
(TTS) synthesis, to convey that information back 
to the human in spoken form. (Fig. 3). In addition, 
SDSs need to be ready to interact with users when 
an error in speech recognition occurs; to decide 
what words might be incorrectly recognized; and 
to determine what the user actually said, either 
automatically or via dialogue with the user. In 
speech-to-speech translation systems, MT com- 
ponents are also needed to facilitate dialogue 
between speakers of different languages and the 
system, to identify potential mistranslations before 
they occur, and to clarify these with the speaker. 

Practical SDSs have been enabled by break- 
throughs in speech recognition accuracy, mainly 
coming from replacing traditional acoustic feature- 
modeling pipelines with deep-leaming models that 
map sound signals to sequences of human language 
sounds and words {20). Although SDSs now work 
fairly well in limited domains, where the topics of 
the interaction are knovm in advance and where 
the words people are likely to use can be predeter- 
mined, they are not yet very successful in open- 
domain interaction, where users may talk about 
anything at aH. Chatbots following in the tradition 
of ELIZA {21) handle open-domain interaction by 
cleverly repeating variations of the human input; 
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Named entity recognition: 



Co-reference: 



INNPI INNPI [RB] IVBDl Qn] (NNP] |NNP| □ fCCl|PRP||VBZ| |VBG| |PRP| |W| IPRPI ITI 
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Mrs. Clinton previously worked for Mr. Obama, but she is now distancing herself from him. 
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Mrs. Clinton previously worked for Mr. Obama, but she is now distancing herself from him. 




Fig. 1. Many language technology tools start by doing linguistic structure analysis. Here we show output from Stanford CoreNLP. As shown from top to 
bottom, this tool determines the parts of speech of each word, tags various words or phrases as semantic named entities of various sorts, determines which entity 
mentions co-refer to the same person or organization, and then works out the syntactic structure of each sentence, using a dependency grammar analysis. 
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The protests escalated over the weekend <EOS> 




Die proteste waren am wochenende eskaliert <EOS> The protests escalated over the weekend 



Fig. 2. A deep, recurrent neural MT ^stem (10). Initially trained on parallel sentences that translate each 
other, the model learns a representation of each word as a real-valued vector and internal parameter matrices so 
as to optimize translation quality. The trained network can then translate new sentences. Each arrow represents 
a computation unit of a matrix multiplication followed by a nonlinear transformation: the small vectors shown in 
the illustration might really be 1000-dimensional. The recurrent network first encodes the meaning of the source 
sentence (left side, blue). It maintains an internal state representing a partial sentence, which is updated after 
each new word is read (horizontal arrows). Having the upper network layers [mapped to by additional (upper 
vertical) computation arrows] makes this a deep recurrent network. Adding depth improves the ability of the 
model to learn, generalize, and remember. Once the end of the sentence (denoted by <EOS>) is reached 
(middle, dark blue), the network additionally starts to produce a word of translated output at each step from its 
internal state (using a multiclass logistic regression-style model). During translation generation (right side, 
green), the last generated word is fed in as the input at each step. From the stored hidden state and this input, 
the model calculates the next word of the translation. This process repeats until <EOS> is generated. 



this approach is also being attempted in spoken- 
chat systems designed to provide a sense of com- 
panionship for target audiences such as the elderly 
or individuals with dementia (22). In spoken dia- 
logue, information about the speaker’s mental 
state inferred from multimodal information can 
be used to supplement the system’s knowledge of 
what the user is saying. 

There are many challenges in building SDSs, 
in addition to the primary challenge of improv- 
ing the accuracy of the basic ASR, DM, and TTS 
building blocks and extending their use into less- 
restricted domains. These include basic problems 
of recognizing and producing normal human 
conversational behaviors, such as turn-taking 
and coordination. Humans interpret subtle cues 
in speakers’ voices and facial and body gestures 
(where available) to determine when the speaker 
is ready to give up the turn versus simply pausing. 
These cues, such as a filled pause (e.g., “um” or 
“uh”), are also used to establish when some 
feedback from the listener is desirable, to indi- 
cate that he or she is listening or working on a 
request, as well as to provide “grounding” (i.e., 
information about the current state of the con- 
versation). Non-humanlike latency often makes 
SDS burdensome, as users must wait seconds to 
receive a system response. To address this, re- 
searchers are exploring incremental processing 
of ASR, MT, and TTS modules, so that systems 
can respond more quickly to users by beginning 
these recognition, translation, and generation 
processes while the user is still speaking. Hu- 



mans can also disambiguate words such as “yeah” 
and “okay,” which may have diverse meanings— 
including agreement, topic shift, and even 
disagreement— when spoken in different ways. 
In successful and cooperative conversations, hu- 
mans also tend to entrain to their conversational 
partners, becoming more similar to each other in 
pronunciation, word choice, acoustic and pro- 
sodic features, facial expressions, and gestures. 
This tendency has long been used to subtly in- 
duce SDS users to employ terms that the system 
can more easily recognize. Currently, research- 
ers are beginning to believe that systems (parti- 
cularly embodied agents) should entrain to their 
users in these different modalities, and some ex- 
perimental results have shovm that users prefer 
such systems (23) and even think they are more 
intelligent (24). Open issues for DM have long been 
the determination of how to architect the appro- 
priate dialogue flow for particular applications, 
where existing experimental data may be sparse 
and some aspects of the dialogue state may not yet 
have been observed or even be observable from the 
data. Currently, the most widely used approach is 
the POMDP (partially observable Markov decision 
process), which attempts to identify an optimal 
system policy by maintaining a probability distri- 
bution over possible SDS states and updating this 
distribution as the system observes additional 
dialogue behavior (25). This approach may make 
use of the identification of dialogue acts, such as 
whether the user input represents a question, 
statement, or indication of agreement, for example. 



Machine reading 

The printed word has great power to enlighten. 
Machine reading is the idea that machines could 
become intelligent, and could usefully integrate 
and summarize information for humans, by read- 
ing and understanding the vast quantities of text 
that are available. 

In the early decades of artificial intelligence, many 
researchers focused on the approach of trying to 
enable intelligent machines by manually building 
large structured knowledge bases in a formal logi- 
cal language and developing automated reasoning 
methods for deriving fiirther facts from this knowl- 
edge. However, with the emergence of the mod- 
em online world, what we mainly have instead is 
huge repositories of online information coded in 
human languages. One place where this is tme is 
in the scientific literature, where findings are still 
reported almost entirely in human language text 
(with accompanying tables and diagrams). How- 
ever, it is equally tme for more general knowledge, 
where we now have huge repositories of infor- 
mation such as Wikipedia (26). The quantity of 
scientific literature is growing rapidly: For example, 
the size of the U.S. National libraiy of Medicine’s 
Medline index has grovm exponentially (27). At 
such a scale, scientists are unable to keep up with 
the literature, even in their narrow domains of 
expertise. Thus, there is an increased need for 
machine reading for the purposes of comprehend- 
ing and summarizing the literature, as well as ex- 
tracting facts and hypotheses from this material. 

An initial goal is to extract basic facts, most 
commonly a relation between two entities, such 
as “child of’ (for instance. Bill Clinton, Chelsea 
Clinton). This is referred to as relation extrac- 
tion. For particular domain-specific relations, 
many such systems have been successfully built. 
One technique is to use handwritten patterns 
that match the linguistic expression of relations 
(e.g., <PERSON>’s daughter, <PERSON>). Bet- 
ter results can be obtained through the use of 
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Fig. 3. A spoken dialogue system.The three main 
components are represented by rectangles: arrows 
denote the flow of information. 
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ML. A structured prediction classifier proposes 
instances of such relations based on extracted 
features from the sequence of words and gram- 
matical structure of a sentence (28, 29). Such sys- 
tems are the mainstay of literature fact-extraction 
tools in fields such as biomedicine (30, 31). 

In many scientific fields, there have been ma- 
jor efforts to build databases of structured infor- 
mation based on the textual scientific record, 
such as the Gene Ontology database (32) in bio- 
medicine or the PaleoBiology Database for fossil 
records (33). This has generally been done man- 
ually, via concerted work by trained profes- 
sionals. Using artificial intelligence software to 
extract these databases, as well as to perform 
subsequent reasoning and hypothesis genera- 
tion, has become a major research goal. One 
subfield where these questions have been ac- 
tively pursued is pharmacogenomics (34). For 
example, Percha et at. (3S) trained a model of 
drug-drug interactions based on drug-gene in- 
teractions extracted from the literature and were 
able to use it to predict novel drug-drug interactions. 

If a partial knowledge base— for instance, Freebase 
(36), dbpedia (37), Wikidata (38) (related to Wikipe- 
dia), or the Gene Ontology database (32)— has already 
been extracted from biomedical research articles, 
then there is an opportunity to automatically align 
known facts from the knowledge base with puta- 
tive expressions of those facts in text. The type labels 
from this mapping can then be used as if they were 
supervised data for ML information-extraction sys- 
tems (Fig. 4). This is referred to as distantly super- 
vised relation extraction. Early systems aligned 
entity mentions and then made the naive assump- 
tion that sentences containing a pair of entities 
expressed every known relation between the two 
entities in the database (39). More recent systems 
have used increasingly sophisticated probabilistic 
inference to discern which textual clauses map to 
vdiich facts in the knowledge base, or to something 
else entirely (40, 41). A dramatic recent applica- 
tion of this approach has been the DeepDive sys- 
tem (42), which aims to automate the construction 
of such systems by providing efficient large-scale 
learning and inference so a user can simply focus 
on good features for their domain. PaleoDeepDive, 
its application to the fossil record, has recently been 
shown to do a better job at fact extraction from 
journal articles than the scientist volunteers who 
maintain the PaleoBiology Database (43). 

The relation-extraction task is made general, 
if less semantically precise, by aiming to ex- 
tract all relations from any piece of text, a task 
normally referred to as open information ex- 
traction (Open IE) (44). Early work emphasized 
the development of simple but highly scalable 
fact-extraction techniques that do not require 
any kind of hand-labeled data (4S). With ever- 
grovdng computational power, a second genera- 
tion of work increasingly emphasized careful use 
of linguistic structure, which can reliably be ex- 
tracted with the use of detailed NLP (46). 

Currently, a number of avenues are being 
explored to further extend the ability of compu- 
ters to build and use knowledge bases starting 
from textual information. An exciting unification 



is the proposal for universal schemas (47), which 
allow simultaneous inference and knowledge- 
base completion over both the open set of textual 
relations (such as ‘T)om in”) found in Open IE 
and the more exact schema of databases (such 
as per:city_of_birth). Even with all of our text- 
extraction techniques, any knowledge base will 
only be partial and incomplete; some recent 
work explores how it can be probabilistically com- 
pleted to deliver a form of common-sense rea- 
soning (48). Finally, we hope to move beyond 
simply extracting relations, events, and facts to 
be able to understand the relations between 
events (such as causation) and complex multistep 
procedures and processes. In (49), Berant et al. 
explore how this can be done for understanding 
the steps in biological processes, shovdng that 
extracting explicit process structures can improve 
the accuracy of question answering. The flip side of 
machine reading is to provide question-answering 
systems, by which humans can get answers from 
constructed knowledge bases. There has recently 
been dramatic progress in building such systems 
by learning semantic parsers (50). 

Mining social media 

The development of social media has revolution- 
ized the amount and types of information avail- 
able today to NLP researchers. Data available 
from sources such as Twitter, Facebook, YouTube, 
blogs, and discussion forums make it possible 
to examine relations between demographic in- 
formation, language use, and social interaction 
(51). Researchers use Web-scraping techniques, 
often via application program interfaces pro- 
vided by websites, to dovmload previously unim- 
aginable amounts and categories of data. Using 
statistical and ML techniques, they learn to iden- 
tify demographic information (such as age and 
gender) from language, track trending topics and 
popular sentiment, identify opinions and beliefs 
about products and politicians, predict disease 
spreading (for instance, with Google Flu Trends: 
www.google.oig/flutrends/) from symptoms men- 
tioned in tweets or food-related illnesses (52), 
recognize deception in fake reviews (53), and 
identify social networks of people who interact 
together online. 

In this era of big data, the availability of social 
media has revolutionized the ways advertisers, 
journalists, businesses, politicians, and medical 
experts acquire their data and the ways in which 
those data can be put to practical use. Product 
reviews can be mined to predict pricing trends 
and assess advertising campaigns. Political forums 
can be searched to predict candidate appeal and 
performance in elections. Social networks can be 
examined to find indicators of power and influ- 
ence among different groups. Medical forums can 
be studied to discover common questions and 
misconceptions about sufferers from particular 
medical conditions so that website information 
can be improved. 

Social media also provide very large and rich 
sources of conversational data in Web forums 
that can provide ‘Tound” data for the study of lan- 
guage phenomena such as code-switching (mixed 
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Learn how to extract 
slots from free text: 

was born on <DATE> 
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born...at<LOCATION> 



per: city_of_birth 



Fig. 4. Distantly supervised learning. In this ap- 
proach, facts in a structured knowledge repre- 
sentation are projected onto pieces of text that 
mention the people, places, dates, etc., that appear 
in knowledge-base entries. This projection is noisy, 
but when done over large quantities of text, it pro- 
vides enough signal to successfully learn good clas- 
sifiers for extracting relations from text. [Photo 
source: National Library of Australia, http://nla.gov. 
au/nla.pic-anl2267621] 
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language in bilingual speech), hedging behavior 
(v^ords and phrases indicating lack of commitment 
to a proposition such as “sort of”), and hate speech 
or bullying behavior. Social media exist in a wide 
variety of languages, including both HRLs and 
LRLs. These data can be invaluable for enriching 
ASR language models and developing TTS synthe- 
sizers without the need to create costly special- 
purpose corpora. In turn, these technologies can 
be useful in producing SDSs in LRL areas. Such 
systems can provide millions of people with the 
ability to obtain information over their cell phones 
(which are ubiquitous, even among populations 
with low literacy rates or whose languages or 
dialects have no standard written form), similar to 
the residents of HRL countries. The development 
of tools for LRLs from found LRL data, by adapt- 
ing HRL tools, is another important way to use 
found text data. A particular application of data 
mining in LRLs is the mining of data collected 
from Twitter or blogs to provide valuable infor- 
mation for disaster relief organizations, identify- 
ing the most serious problems, where they occur, 
and who is experiencing them. 

There are also some drawbacks to social media 
data mining. There is an increasing concern for 
privacy issues, particularly for an individual’s con- 
trol over their ovm data versus researchers’ desire 
to mine it. Sites such as Twitter severely limit a 
researcher’s ability to dovmload data, which im- 
pedes speedy corpus collection. There is also a 
major issue with discovering “ground truth” in 
online postings, because there is no clear way of 
validating an individual’s demographic information; 
the validity of posts concerning events; and most 
reviews of hotels, restaurants, and products. Aggre- 
gating information from multiple sources at similar 
times can address some validity issues, and sites do 
attempt to identify spurious reviews, but this issue 
remains perhaps the most difficult one for those 
working with social media data. 

Analysis and generation of speaker state 

Speaker states (54), also termed “private states” 
(55), include opinions, speculations, beliefs, emo- 
tions, and any other evaluative views that are 
personally held by the speaker or writer of a 
language. Much of the work in NLP has focused 
on sentiment analysis (identification of positive 
or negative orientation of textual language) and 
identification of belief states (committed belief. 



uncommitted belief, or neutrality of a sentence) 
on the basis of lexical and syntactic information. 
Both sentiment and belief constitute attitudes to- 
ward events and propositions, although sentiment 
can also concern attitudes toward objects such as 
people, organizations, and abstract concepts. De- 
tection of sentiment and emotion in text requires 
lexical and sentence-level information. Sentiment 
can be signaled by words conveying positive or 
negative orientation: For example, “sad,” Vorried,” 
“difficult,” and Veak” are all words with negative 
orientation, whereas “comfortable,” “important,” 
“successful,” and “interesting” convey a positive 
sentiment. Online sentiment dictionaries, such as 
Whissel’s Dictionary of Affect (55), and systems 
created from subject-ranked terms, such as Tausczik 
and Pennebaker’s LIWC (Linguistic Inquiry and 
Word Count) (57), can be used to assess positive 
and negative sentiment in a text. More sophis- 
ticated approaches to sentiment analysis also seek 
to identify the holder (source) as well as the object 
of the sentiment: for instance, who is positive about 
what person, country, activity, or concept (55). 

The speech community has also studied pos- 
itive and negative attitudes by focusing more 
generally on the identification of positive and neg- 
ative emotions, primarily using acoustic and pro- 
sodic information. However, more work is currently 
being done to identify particular emotions, such 
as Ekman’s classic six basic emotions (anger, dis- 
gust, fear, happiness, sadness, surprise), which 
maybe reactions to events, propositions, or objects. 
There has also been considerable research using 
features that have proven important in recog- 
nizing classic emotions to identify other speak- 
er states (such as deception), medical conditions 
(such as autism and Parkinson’s disease), speaker 
characteristics (such as age, gender, likeability, 
pathology, and personality), and speaker conditions 
(such as cognitive load, drunkenness, sleepiness, 
interest, and trust). Corpora collected for such 
studies have been used in the Interspeech Para- 
linguistic Challenges, which have been conducted 
since 2009. Emotion generation has proven a 
more difficult challenge for TTS synthesis. Although 
there are some systems (e.g., MARY) that at- 
tempt to generate emotions such as depression, 
aggression, or cheerftilness (58), the best synthe- 
sized emotion still comes from corpora recorded for 
particular emotions by voice talent imitating those 
emotions. 



Sentiment classification is widely used in opin- 
ion identification (positive or negative views of 
people, institutions, or ideas) in many languages 
and genres. Particular applications abound, such 
as identifying positive and negative movie or pro- 
duct reviews (59, 60) and predicting votes from con- 
gressional records (61) or Supreme Court decisions 
from court proceedings. Figure 5 illustrates a fypical 
restaurant review, annotated for positive, negative, 
and neutral sentiment, as well as basic emotions. 

Mining social media for sentiment or classic emo- 
tions has been a particularly popular topic for the 
purposes of assessing the “public mood” from Twit- 
ter, predicting stock market trends, or simply eval- 
uating a community’s mental state (52). Social media 
such as Twitter, blog posts, and forums also provide 
researchers with very laige amounts of data to use in 
assessing the role of sentiment and emotion in 
identifying other linguistic or social phenomena 
[e.g., sarcasm (63), power relationships, and so- 
cial influence (54)], as well as mental health is- 
sues [e.g., depression (55)]. 

Conclusion and outlook 

Many times during the past 50 years, enthusi- 
astic researchers have had high hopes that the 
language-understanding ability of robots in sci- 
ence fiction movies was just around the corner. 
However, in reality, speech and language under- 
standing did not work well enough at that time 
to power mainstream applications. The situation 
has been changing dramatically over the past five 
years. Huge improvements in speech recognition 
have made talking to your phone a commonplace 
activity, especially for young people. Web search 
engines are increasingly successfiil in understanding 
complex queries, and MT can at least yield the gist 
of material in another language, even if it cannot 
yet produce human-quality translations. Com- 
puter systems trade stocks and futures auto- 
matically, based on the sentiment of reports about 
companies. As a result, there is now great com- 
mercial interest in the deployment of human 
language technology, especially because natural 
language represents such a natural interface 
when interacting with mobile phones. In the 
short term, we feel confident that more data 
and computation, in addition to recent advances 
in ML and deep learning, will lead to further 
substantial progress in NLP. However, the truly 
difficult problems of semantics, context, and 



Breakfast on Broadway is a new place focusing on, you guessed 
it, breakfast/brunch. Went there last Sunday around 1. The food 
was not bad but the service was pretty terrible. We had to wait 15 
minutes just to get menus and another 30 to get something to 
eat. And there were only a few tables occupied! If you don’t mind 
the wait though, the price is right. I’ll probably give it another try. 
Maybe they need time to get their act together. 



Breakfast on Broadway is a new place focusing on, you guessed it, 
breakfast/brunch. Went there last Sunday around 1. The food was 
not bad but [Anger: the service was pretty terrible]. [Disgust: We 
had to wait 15 minutes just to get menus and another 30 to get 
something to eat. And there were only a few tables occupied!] If 
you don’t mind the wait though, the price is right. I’ll probably give 
it another try. [Uncertainty: Maybe they need time to get their act 
together.] 



Fig. 5. Manually annotated text analysis on a sample restaurant review. Sentiment analysis is shown on the left (blue, positive sentiments: red, negative: gray, 
neutral). In the emotion analysis on the right, emotions are shown in bold type and delineated by square brackets. Note in particular the importance of going beyond 
simple keyword analysis: for example, “not” has scope over “bad,” which might mislead simple systems. Also, the presence of “hedge” words and phrases, which muddle 
the intended meaning (e.g., “pretty,” which has a positive connotation, modifying the negative word “terrible”), somewhat decreases the negative score of the next clause. 
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knowledge will probably require new discov- 
eries in linguistics and inference. From this per- 
spective, it is worth noting that the development 
of probabilistic approaches to language is not 
simply about solving engineering problems: Prob- 
abilistic models of language have also been reflected 
back into linguistic science, where researchers are 
finding important new applications in describing 
phonology { 66 ), understanding human language 
processing { 67 ), and modeling linguistic seman- 
tics and pragmatics { 68 ). Many areas of linguistics 
are themselves becoming more empirical and 
more quantitative in their approaches. 
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REVIEW 

Economic reasoning and 
artificial intelligence 

David C. Parkes^* and Michael P. Wellman^* 

The field of artificial intelligence (Al) strives to build rational agents capable of perceiving the 
world around them and taking actions to advance specified goals. Put another way, Al 
researchers aim to construct a synthetic homo economicus, the mythical perfectly rational 
agent of neoclassical economics. We review progress toward creating this new species of 
machine, machina economicus, and discuss some challenges in designing Als that can reason 
effectively in economic contexts. Supposing that Al succeeds in this quest, or at least comes 
close enough that it is useful to think about Als in rationalistic terms, we ask how to design the 
rules of interaction in multi-agent systems that come to represent an economy of Als. Theories of 
normative design from economics may prove more relevant for artificial agents than human 
agents, with Als that better respect idealized assumptions of rationality than people, interacting 
through novel rules and incentive systems quite distinct from those tailored for people. 



E conomics models the behavior of people, 
firms, and other decision-makers as a means 
to understand how these decisions shape 
the pattern of activities that produce value 
and ultimately satisfy (or fail to satisfy) hu- 
man needs and desires. In this enterprise, the 
field classically starts from an assumption that 
actors behave rationally— that is, their decisions 
are the best possible given their available actions, 
their preferences, and their beliefs about the out- 
comes of these actions. Economics is drawn to 
rational decision models because they directly con- 
nect choices and values in a mathematically precise 
manner. Critics argue that the field studies a mythi- 
cal species, homo economicus (“economic man”) 
and produces theories with limited applicability 
to how real humans behave. Defenders acknowl- 
edge that rationality is an idealization but counter 
that the abstraction supports powerful analysis, 
which is often quite predictive of people’s behav- 
ior (as individuals or in aggregate). Even if not 
perfectly accurate representations, rational models 
also allow preferences to be estimated from ob- 
served actions and build understanding that can 
usefully inform policy. 

Artificial intelligence (Al) research is likewise 
drawn to rationality concepts, because they pro- 
vide an ideal for the computational artifacts it 
seeks to create. Core to the modern conception of 
Al is the idea of designing agents: entities that 
perceive the world and act in it (i). The quality of 
an Al design is judged by how well the agent’s 
actions advance specified goals, conditioned on 
the perceptions observed. This coherence among 
perceptions, actions, and goals is the essence of 
rationality. If we represent goals in terms of pref- 
erence over outcomes, and conceive perception 
and action within the framework of decision- 
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making under uncertainty, then the Al agent’s 
situation aligns squarely with the standard eco- 
nomic paradigm of rational choice. Thus, the Al 
designer’s task is to build rational agents, or agents 
that best approximate rationality given the limits 
of their computational resources (2-4). In other 
words, Al strives to construct— out of silicon (or 
whatever) and information— a synthetic homo eco- 
nomicus, perhaps more accurately termed mach- 
ina economicus. 

The shared rationality abstraction provides a 
strong foundation for research that spans Al and 
economics. We start this review by describing pro- 
gress on the question of how to operationalize 
rationality and how to construct Al agents that 
are able to reason about other Als. Supposing 
that Al research succeeds in developing an agent 
that can be usefully modeled as rational (perhaps 



more so than human agents), we turn to research 
on the design of systems populated by multiple 
Als. These multi-agent systems will function as 
Al economies, with Als engaged in transactions 
with other Als as well as with firms and people. 
This prospect has spawned interest in expanding 
theories of normative design from economics, op- 
timizing rules of encounter (5) to guide multi-agent 
interactions. Systems populated by Als may exhibit 
new economic phenomena and thus require a new 
science with which to understand the way they 
function and to guide their design. For example, 
although human cognitive constraints limit the 
design of current markets, systems designed for 
Als may admit more complex interfaces, impose 
greater calculation burdens, and demand more 
stamina of attention. 

At the same time, the ways in which the behav- 
ior of Als deviate from the behavior of people can 
present new challenges. We can already glimpse 
the future of economic Als, with simple Al hots 
pricing books for sale on Amazon and scanning 
for restaurant tables on Opentable for resale at a 
profit (6). Such Als may introduce some efficien- 
cies, but their lack of common sense and their 
designer’s failure to anticipate interactions can 
also lead to books priced at $23 million (7). More 
sophisticated Al strategies, presumably more 
carefully vetted, exert a large influence on finan- 
cial markets, with automated trading algorithms 
estimated to be responsible for more than 70% of 
trades on U.S. stock markets (8). Given the conse- 
quences, it is important to understand the effect 
of ubiquitous automated agents on the perform- 
ance of economic systems. As reasoning is shifted 
from people to Als— designed to learn our pref- 
erences, overcome our decision biases, and make 
complex cost-benefit trade-offs— how too should 
the economic institutions that mediate everyday 
transactions change? 
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Fig. 1. A bounded reinforcement learning agent performs better by pursuing a designed reward 
function different from the objective reward: its actual fitness evaluation. Results (left) from a 
gridworld foraging domain (right), for various limits on the agent’s planning horizon (84). Unless the 
agent is perfectly rational (i.e., no horizon limit)— not typically feasible in realistic applications— the 
designer can often achieve better fitness by directing the agent to optimize an alternative measure. 
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Fig. 2. Researchers produced steady exponential progress on solving games of imperfect information from 1995 to the present. Up to 2007 (left), game 
size was generally reported in terms of nodes in the game tree. Based on methods introduced around that time, it became more meaningful (right) to report size in 
terms of the number of information sets (each many nodes), which represent distinct situations as perceived from the perspective of a player. The circled data 
points correspond to the same milestone: combining the two graphs thus demonstrates the continual exponential improvement. Data are from (23, 35, 85-90). 



We focus here on some of the research direc- 
tions we consider most salient for a future syn- 
thesis of economics and AI engendered by the 
emergence of mcichina economicus. Interesting 
as they are, we only briefly mention here the 
many exciting applications of AI to problems in 
economics such as matching {9), market clearing 
{10), and preference modeling for smart grids 
{11). Nor will we showcase the many ways in 
which economic theory is finding application 
today within AI— for example, game-theoretic 
approaches to multi-agent learning {12) and 
voting procedures to combine the opinions of 
AIs {13). 

Building machina economicus 

Constructing a rational AI raises a host of tech- 
nical challenges not previously addressed in the 
long tradition of rationalistic modeling in the 
social sciences. For economics, the agent atti- 
tudes (e.g., beliefs and preferences) underlying 
rationality are conceptual abstractions. Econo- 
mists need not explain how capabilities and 
preferences, for example, are encoded, nor the 
algorithm by which an agent plans what actions 
to take conditional on its perceptions. Compu- 
tation is abstracted away in the standard eco- 
nomic model and is precisely what the AI scientist 
must account for to operationalize rationality in a 
realized agent. 

This does not mean that an AI design needs to 
incorporate data structures corresponding directly 
to rationality constructs, although many AI archi- 
tectures do feature direct representations for prop- 
ositions, goals, and the like. Such representations 
may simplify the analysis of AI systems— for exam- 
ple, we can ask whether an inference algorithm 
operating on logical expressions possesses desir- 
able properties such as soundness: that all con- 
clusions follow from the premises. Similarly, if an 
AI’s beliefs are encoded as probability distribu- 
tions, we can ask whether it updates its beliefs 
from observations in proper accord with Bayesian 
theory. However, care must be taken in under- 



standing an agent’s attitudes solely in terms of 
its internal data structures. Imperfections in 
decision-making may mean that the beliefs held 
and objectives pursued by a computational agent, 
in effect, vary systematically from those directly 
encoded. 

As an example illustrating this distinction, 
machine-learning researchers adapted from ani- 
mal learning the concept of reward shaping {14). 
In reinforcement learning, the agent derives a 
policy (mapping from perception sequences to 
actions) based on rewards representing instan- 
taneous value associated with a state and action. 
A designer specifying the input reward can often 
train the agent more efficiently by shaping the 
reward signal over the learning process to facil- 
itate convergence to behavior optimizing the 
designer’s objective. The framework of optimal 
rewards (75) provides a general treatment distin- 
guishing reward specifications and designer goals. 
As shown in Fig. 1, the optimal reward input to 
the agent does not generally correspond to the 
designer’s ideal reward. This perspective helps 
explain the role of intrinsic motivations (e.g., 
curiosity) in a flexible learning agent. 

Although the mantle of designing machina 
economicus may not be adopted (particularly 
in such explicit terms) by all AI researchers, 
many AI advances over the past few decades 
can be characterized as progress in operation- 
alizing rationality. For instance, probabilistic 
reasoning was largely eschewed by AI 30 years 
ago but now pervades the field, thanks to 
developments in representation and inference 
using Bayesian networks and related graphi- 
cal formalisms. Expressing uncertainty about 
general relationships, beyond mere proposi- 
tions, is routinely supported in probabilistic 
modeling languages {16). Statistical approaches 
now dominate machine learning and natural 
language processing {17, 18). Likewise, prefer- 
ence handling (including methods for eliciting 
preferences from the designer of an AI agent, 
compactly representing preferences over com- 



plex domains, and enabling inference about pre- 
ferences) is regarded as a necessary AI facility. 
Planning, the AI subfield concerned with action 
over time, now conventionally frames its prob- 
lem as one of optimization, subject to resource 
constraints, multiple objectives, and probabilistic 
effects of actions. 

Will AI succeed in developing the ideal ration- 
al agent? As much as we strive to create machina 
economicus, absolutely perfect rationality is un- 
achievable with finite computational resources. 
A more salient question is whether AI agents will 
be sufficiently close to the ideal as to merit think- 
ing about them and interacting with them in 
rationalistic terms. Such is already the case, at 
least in a limited sense. Whenever we anthropo- 
morphize our machines, we are essentially treat- 
ing them as rational beings, responding to them 
in terms of our models of their knowledge, goals, 
and intentions. A more refined version of the 
question is whether our formal rationality the- 
ories will fit well the behavior of AI agents in 
absolute terms or compared to how well the 
theories work for people. Without offering any 
judgment on the question of how well rationality 
theories capture essential human behavior, we 
note the irony in the prospect that social science 
theories may turn out to apply with greater fidel- 
ity to nonhuman agent behavior. 

Reasoning about other agents 

The issue of agent theorizing is not merely aca- 
demic. If we can build one AI agent, then we can 
build many, and these AIs will need to reason 
about each other as well as about people. For AIs 
designed to approximate machina economicus, it 
stands to reason that they should treat each other 
as rational, at least as a baseline assumption. These 
AIs would adopt a game-theoretic view of the 
world, where agents rationally respond to each 
others’ behavior, presumed (recursively) to be ra- 
tional as well. A consequence is that agents would 
expect their joint decisions to be in some form of 
equilibrium, as in standard economic thinking. 
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That AIs (or Al-human combinations) are rea- 
sonably modeled as approximately rational is the 
premise of a growing body of AI research apply- 
ing economic equilibrium models to scenarios 
involving multiple agents {19). The approach has 
achieved notable successes, providing evidence 
for the premise, at least in particular circum- 
stances. Just as single-agent rationality does not 
require literal expected-utility calculations, appli- 
cability of an equilibrium model does not require 
that agents themselves be explicitly engaged in 
equilibrium reasoning. For example, the litera- 
ture on learning in games {20) has identified 
numerous conditions in which simple adaptive 
strategies converge to strategic equilibria. We 
can evaluate the effectiveness of economic mod- 
eling by examining agents built by AI designers 
for specified tasks. For instance, in a study of AI 
trading agents competing in a shopping game 
{21), an agent using standard price equilibrium 
models from economics (specifically, Walrasian 
equilibrium) achieved comparable prediction ac- 
curacy to sophisticated machine-learning ap- 
proaches without using any data, even though 
none of the other agents employed equilibrium 
reasoning. 



A Prisoner’s dilemma 
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poker game {23). HULHE is the largest game of 
imperfect information ever solved (with more 
than 10^^ information sets after removing sym- 
metries) and the first imperfect-information 
game widely played by humans to be solved. The 
solution was the culmination of two decades of 
effort by a series of researchers (see Fig. 2), 
beginning with the exact solution of simplified 
poker games, and proceeding to the approxi- 
mate solution of abstracted versions of the full 
game {24). Computing the approximate Nash 
equilibrium of the full game required massive 
computation and new methods for equilibrium 
search based on regret-matching techniques from 
machine learning. The result is a strategy against 
which even a perfect opponent cannot earn a 
detectable profit. 

In general, the optimal strategy against perfect 
opponents may not be the ideal strategy against 
the more typical fallible kind. Despite consider- 
able effort, however, researchers have not found 
poker algorithms that perform considerably bet- 
ter than game-theoretic solutions, even against 
natural distributions of opponents. It has also 
turned out that game-theoretic approaches have 
been more successful than alternatives, even for 



B Mediated Prisoner’s dilemma 
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Fig. 3. Each entry gives the utility to (row player, column player). (A) Prisoner’s dilemma. The 
dominant strategy equilibrium is (Defect, Defect). (B) Mediated prisoner’s dilemma. The dominant 
strategy equilibrium is (Mediator, Mediator). 



In the rest of this section, we describe fur- 
ther examples in which economic modeling, in 
the form of game-theoretic algorithms, has pro- 
vided an effective way for AIs to reason about 
other agents. The first example is computer poker. 
Although poker is an artificial game, many humans 
have invested a great deal of time and money 
to develop their playing skills. More important, 
poker’s uncertainty and complexity have made 
it a compelling challenge problem for AI tech- 
niques. Early approaches aimed to capture the 
knowledge of expert human players {22), but 
over the past decade, game-theoretic algorithms 
have predominated. Technically, poker is a game 
of imperfect information, where each player 
knows elements of history (cards dealt to them) 
that are secret from others. As uncertainty gets 
partially resolved over time, through card turns 
and betting, players must update their beliefs 
about both card outcomes and the beliefs of 
others. 

A major milestone in computer poker was 
achieved in 2014 with the effective solution of 
‘Tieads up limit hold’em” (HULHE), which is a 
standard two-player version of the most popular 



poker variants that are far from being exactly solved, 
such as no-limit (where bets are unrestricted) 
(25), or games with three or more players {26). 

Much of the interest in game-theoretic rea- 
soning for AI is driven by its applicability to real- 
world problems. The most prominent area of 
application in recent years, and our second exam- 
ple, is that of security games, based on a pioneer- 
ing series of systems developed by Tambe et al. 
{27). In these systems, an agent decides how to 
defend facilities (e.g., airport security through 
placement of checkpoints) by solving a game 
where an attacker is presumed to rationally plan 
in response to the defender’s decision. This ap- 
proach has been successfully deployed in a variety 
of domains, including airport and airline sec- 
urity and coast guard patrols. 

As for any game-theoretic approach, the rec- 
ommendations from these systems are sensitive 
to assumptions made about the other agents 
(here, attackers): their respective preferences, 
beliefs, capabilities, and level of rationality. Rep- 
resentational approaches from AI provide flexi- 
bility, allowing the assumptions made in the strict 
versions typically employed by game theorists to 



be relaxed {28). The field of behavioral game 
theory has developed detailed predictive models 
based on how humans have been observed to 
deviate from game-theoretic rationality {29). Such 
predictive models can be readily incorporated in 
existing game-theoretic reasoning algorithms, as 
has been demonstrated in the context of modeling 
attackers in security games {30). An interesting 
open question is whether the kinds of behavioral 
models that best explain human decision-making 
[see Wright and Leyton-Brown {31) for a meta- 
study] will also prove effective in capturing the 
bounded rationality of computational agents. 

Designing multi-agent systems 

At the multi-agent level, a designer cannot di- 
rectly program behavior of the AIs but instead 
defines the rules and incentives that govern 
interactions among AIs. The idea is to change 
the “rules of the game” (e.g., rewards associated 
with actions and outcomes) to effect change in 
agent behavior and achieve system-wide goals. 
System goals might include, for instance, pro- 
moting an allocation of resources to maximize 
total value, coordinating behavior to complete a 
project on time, or pooling decentralized infor- 
mation to form an accurate prediction about a 
future event. The power to change the interac- 
tion environment is special and distinguishes 
this level of design from the standard AI design 
problem of performing well in the world as given. 

An interesting middle ground is to take the 
world as given but employ reliable entities— 
mediators— that can interact with AIs and per- 
form actions on their behalf {32). Introducing 
mediating entities is relatively straightforward 
in the new AI economy. To see how this can be 
powerful, consider a mediated extension of the 
classic prisoner’s dilemma game (Fig. 3). If both 
AIs grant the mediator the authority to play on 
their behalf (i.e., proxy right), it performs Co- 
operate on behalf of both agents. However, if 
only one AI grants the mediator proxy, it per- 
forms Defect on behalf of that agent. In equi- 
librium, both AIs grant proxy, and the effect is 
to change the outcome from (Defect, Defect) to 
(Cooperate, Cooperate), increasing utility to both 
participants. 

For the more general specification of rules of 
interaction for rational agents, economics has a 
well-developed mathematical theory of mecha- 
nism design {33). The framework of mechanism 
design has been fruitfully applied, for example, to 
the design of matching markets {34) and auc- 
tions {35). Mechanism design is a kind of inverse 
game theory, with the rules inducing a game and 
the quality of the system evaluated in an equi- 
librium. In the standard model, design goals are 
specified in terms of agent preferences on out- 
comes, but these preferences are private and the 
agents are self-interested. A mechanism is a trusted 
entity, able to receive messages from agents that 
make claims (perhaps untruthfiilly) about prefer- 
ences and select an outcome (e.g., an allocation of 
resources or a plan of behavior) on the basis of 
these messages. The challenge is to align incentives 
and promote truthful reports. 
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Varian {36) has argued that the theory of 
mechanism design may actually prove more rel- 
evant for artificial agents than for human agents, 
because AIs may better respect the idealized 
assumptions of rationality made in this frame- 
work. For example, one desirable property of a 
mechanism is incentive compatibility, which stip- 
ulates that truthful reports constitute an equi- 
librium. Sometimes it is even possible to make 
truthful reporting a dominant strategy (optimal 
whatever others do), achieving the strong prop- 
erty of strategy-proofness {37). It seems, however, 
that people do not reliably understand this pro- 
perty; evidence from medical matching markets, 
and also from laboratory experiments, suggests 
that some participants in strategy-proof matching 
mechanisms tiy to misrepresent their preferences 
even though it provides no advantage {38, 39). 

For artificial systems, in comparison, we might 
expect AIs to be truthful where this is optimal 
and to avoid spending computation reasoning 
about the behavior of others where this is not 
useful (5). More generally, mechanism designs 
for AI systems need not be simple because they 
need not be understandable to people. On the 
contrary, AI techniques such as preference rep- 
resentation, preference elicitation, and search 
algorithms can be used to turn the mathematical 
formalisms of mechanism design into concrete 
computational methods {40-42). The design prob- 
lem itself can also be usefully formulated as a 
computational problem, with optimization and 
machine learning used to find solutions to de- 
sign problems for which analytical solutions are 
unavailable {43-46). 

The prospect of an economy of AIs has also 
inspired expansions to new mechanism design 
settings. Researchers have developed incentive- 
compatible multiperiod mechanisms, consider- 
ing such factors as uncertainty about the future 
and changes to agent preferences because of 
changes in local context {47-49). Another direc- 
tion considers new kinds of private inputs be- 
yond preference information {50, 51). For example, 
in a team formation setting, each AI might mis- 
report information about the capabilities of other 
AIs in order to get itself selected for the team {52). 
Similarly, AIs seeking to maximize task assign- 
ments might provide false reports of experience 
in task performance in order to mislead a learn- 
ing mechanism constructing an automatic task 
classifier {53). Systems of AIs can also create new 
challenges for mechanism design. One such chal- 
lenge is false-name bidding, where an AI exploits 
its ability to manage multiple identities. For ex- 
ample, it may gain resources more cheaply by 
dividing a request into a set of smaller requests, 
each placed from a different identity under its 
control. In response, researchers have developed 
mechanisms that are robust to this new kind of 
attack {54). 

The important role of mechanism design in an 
economy of AIs can be observed in practice. Search 
engines run auctions to allocate ads to positions 
alongside search queries. Advertisers bid for their 
ads to appear in response to specific queries (e.g., 
“personal injury lawyer”). Ads are ranked according 



to bid amount (as well as other factors, such as 
ad quality), with higher-ranked ads receiving a 
higher position on the search results page. Early 
auction mechanisms employed first-price rules, 
charging an advertiser its bid amount when its 
ad receives a click. Recognizing this, advertisers 
employed AIs to monitor queries of interest, 
ordered to bid as little as possible to hold onto 
the current position. This practice led to cascades 
of responses in the form of bidding wars, amounting 
to a waste of computation and market ineffi- 
ciency (55). To combat this, search engines in- 
troduced second-price auction mechanisms {37), 
which charge advertisers based on the next-highest 
bid price rather than their own price. This ap- 
proach (a standard idea of mechanism design) 
removed the need to continually monitor the bid- 
ding to get the best price for position, thereby end- 
ing bidding wars (55). 

In recent years, search engine auctions have 
supported richer, goal-based bidding languages. 



For example, an advertiser can ask to maximize 
clicks over a weighted set of queries subject to a 
budget constraint (57, 58). Search engines pro- 
vide proxy agents that then bid on behalf of 
advertisers to achieve the stated goal (59). This 
introduction of proxy agents and the earlier 
switch from first price to second price can be 
interpreted as a computational application of a 
fundamental concept in mechanism design— the 
revelation principle {60-62). Briefly, this states 
that if the rules of a mechanism and the equil- 
ibrium strategies in that mechanism are replaced 
by a new mechanism that is functionally equiv- 
alent to the composition of these rules and strat- 
egies, then the new mechanism will be incentive 
compatible. Although neither redesign provides 
incentive compatibility in a formal sense, both 
second-pricing and proxy bidding can be inter- 
preted as accomplishing on behalf of advertisers 
what they were doing (through AIs) in an earlier 
design (see Fig. 4). Still other ad platform de- 
signs are using a strategy-proof mechanism [the 
Vickrey-Clarke-Groves mechanism {37, 63, 54)] 
to make decisions about the space to allocate to 
ads, which ads to allocate, and which (nonspon- 
sored) content to display to a user (55). 



The tangle between automated agents and the 
design of rules of interaction also features prom- 
inently in today’s financial markets, where the 
dominance of computerized traders has, by most 
accounts, qualitatively shaped the behavior of 
these markets. Although details of implementa- 
tion are closely held secrets, it is well understood 
that techniques from AI and machine learning 
are widely employed in the design and analysis 
of algorithmic traders (55). Algorithmic trading 
has enabled the deployment of strategies that 
exploit speed advantages and has led in turn to a 
costly arms race of measures to respond to market 
information with minimum latency. A proposed 
design response would replace continuous-time 
auctions with periodic auctions that clear on the 
order of once per second, thus negating the 
advantage of tiny speed improvements (57, 68). 

We describe two additional examples of the 
design of multi-agent systems for an economy 
of AIs. The first example system aggregates 



information held by multiple AIs. The rules of a 
system that achieves this goal can be engineered 
purposefully through the design of a prediction 
market (59). Popular versions of prediction mar- 
kets feature questions such as who will be elected 
U.S. president (e.g., Betfair offers many such mar- 
kets). The basic idea of a prediction market is 
to facilitate trade in securities contracts (e.g., a 
possible contract will pay $1 if Hilary Clinton is 
elected). The price that balances supply and de- 
mand is then interpreted as a market prediction 
(e.g., price $0.60 reflects probability 0.6 for the 
payoff event). 

Consider a domain with a large number of in- 
terrelated random variables— for example, “flight 
BA214 delayed by more than 1 hour,” “snowstorm 
in Boston,” “de-icing machine fail,” “incoming 
flight BA215 delayed by more than 1 hour,” and 
“security alert in London.” In a combinatorial 
prediction market {70), a large bet on the con- 
tract “de-icing machine fail” would affect the 
price of “flight BA214 delayed by more than 1 hour” 
and all other connected events. A challenge is that 
the number of conceivable events is exponential in 
the number of random variables. Among other 
properties, a good market design should allow bets 




Fig. 4. Two generations of sponsored search mechanisms. Early designs were first price (FP), and 
advertisers (ADV) used AIs (AI-POS) to maintain a position on the list of search results at the lowest 
possible price. Second-price (SP) auction mechanisms were introduced, designed to replace the 
combination of FP and AI-POS. Advertisers adopted new AIs (AI-GOAL) to achieve higher-level goals 
such as to maximize profit or to maximize the number of clicks. The second price auction was extended to 
include proxy agents (SP+Proxy), designed to replace the combination of SP and AI-GOAL. 
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Fig. 5. In a reputation system for a multi-agent Al, each agent chooses an action, and the combined 
effect of these actions generates rewards (i.e., utility). Based on the actions taken and the rewards 
received, agent i can submit a report, xi, to the reputation system. The reputation system aggregates this 
feedback— for example, providing a ranked list to reflect the estimated trustworthiness of agents. Each 
agent observes this ranked list, and this information may influence future actions. 



on all events about which AIs have information 
(e.g., “de-icing machine fail AND all subsequent 
flights from Boston delayed by more than 1 hour”). 
A good design should also align incentives— for 
example, making it utility-maximizing to trade im- 
mediately on current information until the market 
price reflects an agent’s belief Progress in scaling 
up combinatorial markets has been made by rela- 
ting the problem of pricing bets to well-understood 
problems in statistical inference and convex opti- 
mization {71, 72). Related research advances are 
being made by allowing AIs to transact in hypo- 
theses that are acquired through machine learning 
as well as trade directly in information signals 
rather than beliefs {73-75). 

The second example is the management of 
information concerning the trustworthiness of 
agents within an economy of AIs. Trust that a 
counterparty will complete a transaction or in- 
vest effort or resources is crucial for any well- 
ftmctioning economic system. A standard approach 
is to associate participants with a reputation, 
which can serve to align incentives in the present 
under the threat of a damaged reputation and 
lost opportunities in the future. In addition to 
this problem of moral hazard (i.e., will agents 
behave cooperatively when completing economic 
transactions), reputation systems can address the 
problem of adverse selection (i.e., will high-quality 
agents choose to enter a market in the first place) 
{76, 77). 

A special challenge in an economy of AIs arises 
because of the fluidity of identity and the ease 
with which agents can be replaced. This raises, 
for example, the specter of whitewashing attacks, 
where an AI repeatedly runs down its reputation 
before reentering with a different identity. With- 
out the possibility of enforcing strong identities 
that cannot be changed, this suggests a social 
cost of fluid identities, where it becomes neces- 



sary to impose a penalty on all new participants 
and make them build up reputations from an 
assumption of being untrustworthy {78). 

We should also consider that machina eco- 
nomicus will be strategic in sharing feedback on 
other AIs. For example, in eBay’s original repu- 
tation system, buyers were often reluctant to 
leave negative feedback about deadbeat sellers, 
because the sellers could retaliate with negative 
feedback about the buyer. In response, eBay 
introduced an additional feedback mechanism 
that was one-directional from the buyer to the 
seller and could not be easily traced to a parti- 
cular buyer. The change resulted in a greater 
amount of negative feedback {79). 

The economy of AIs also offers positive oppor- 
tunities for promoting trust through bookkeeping, 
collecting feedback, and tracking the provenance 
of feedback in novel reputation mechanisms (see 
Fig. 5). AI researchers are designing reputation 
systems that align incentives with making truth- 
ful reports, while provably satisfying axiomatic 
properties such as symmetry: Two agents that 
are in an equivalent position from the perspec- 
tive of reports made and received should have 
the same trust score {80, 81). Another example 
is the design of accounting systems that elicit 
truthful reports about the resources contributed 
or work performed by other AIs and enable the 
design of systems to mitigate free-riding and 
promote fair contributions to an economic sys- 
tem {82). Still, the extent to which effective, multi- 
agent AIs can be developed entirely through 
computational infrastructure such as reputation 
mechanisms and without recourse to legal sys- 
tems remains an interesting open question. 

Closing comments 

Whatever one’s thoughts about when or whether 
AI will transcend human-level performance, the 



rapidly advancing capabilities of AI are fueling 
considerable optimism and investment in AI 
research. AI has surpassed or will likely soon 
surpass humans in narrow domains such as 
playing chess, controlling a jumbo jet during 
cruise, making product recommendations, pric- 
ing millions of products on an eCommerce plat- 
form, reasoning about whether a patient is likely 
to be re-admitted to a hospital, and detecting sig- 
nals from a massive volume of financial news 
stories. 

Certainly, many fundamental challenges re- 
main, including how to design reasoning and 
inference methods that effectively balance the 
benefit of additional computation with the costs 
that may arise from additional delay to acting 
in the world and how to design AI systems that 
can learn and generalize from reward signals 
in unconstrained domains. Given that decision 
problems related to economic transactions are 
often relatively well structured, however, it seems 
likely to us that AI will continue to make espe- 
cially rapid inroads in economically important 
applications. This in turn will ensure continued 
effort on methods for rational, economic reason- 
ing toward the broader goal of developing ma- 
china economicus. 

We should not leave the impression that AI 
researchers unanimously embrace economic per- 
spectives on single- or multi-agent AI. For some, 
multi-agent economic models are still seen as a 
distraction. After all, a centralized perspective 
allows focusing on overall goals without worry- 
ing about the incentives of individual parts of 
the system. Others conduct research into multi- 
agent systems composed of agents under the 
control of the designer, so that they can be pro- 
grammed in any way desired. Just as with cen- 
tralized solutions, these so-called “cooperative” 
multi-agent systems allow design without con- 
cern for the self-interest of individual agents, 
albeit often with decomposition or communi- 
cation constraints. But cooperative versus self- 
interested is really a difference in assumptions 
on the power of a system designer, rather than 
a technical dispute. The viewpoint that we as- 
cribe to is that a large number of AI systems will, 
given the existing structure of human economic 
systems, be populated by AIs that are designed, 
deployed, owned, and operated by a myriad of 
different parties, each with possibly misaligned 
goals. Finally, some may object to the economic 
approach on the basis that AIs are and will re- 
main far from perfectly rational, simply by vir- 
tue of physical and computational limits. More 
direct models of the AIs’ computational behav- 
ior, in terms of the automata they are, could in 
principle be more accurate. The analytical util- 
ity of a rationality abstraction for AIs is ulti- 
mately an empirical question to be resolved as 
AI progresses. 

Among those adopting an economic approach, 
there persist some disagreements on specific 
techniques— for example, on the role of equilib- 
rium reasoning. Even if agents can be viewed 
as rational, some question whether it is plausi- 
ble that they reach equilibrium configurations. 
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particularly in situations where multiple equilib- 
ria exist. As Shoham {83) argues, game theory 
lacks a well-accepted pragmatic account of how 
it should be deployed in concrete reasoning con- 
texts. A positive view is that AI researchers, in 
their efforts to operationalize economic reason- 
ing, are developing exactly this needed body of 
pragmatics. 

Some may object that mechanism design is too 
idealized even for systems of AIs— for example, in 
its insistence on design under equilibrium behav- 
ior, its assumption that rules of interaction can be 
designed from scratch, and its lack of attention to 
the details of the human and legal contexts in 
which designed systems will operate. A positive 
view is that AI systems are precisely the kinds of 
environments where we can build tabula rasa 
new rules of interaction, because these rules will 
be realized through the Internet and as programs 
running on computer servers. That such rules of 
interaction can come into existence is as much a 
matter of science and engineering as it is of 
public policy. 

As AI advances, we are confident that econo- 
mic reasoning will continue to have an impor- 
tant role in the design of single-agent and 
multi-agent AIs, and we have argued that, as 
economies of AIs continue to emerge, there will 
need to be a new science to understand how to 
design these systems. These AIs will no doubt 
exert strong forces on the economy and broader 
society; understanding the effect and extent of 
this will shape the research agendas of both AI 
and economics in years to come. 
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REVIEW 

Computational rationality: A 
converging paradigm for intelligence 
in brains, minds, and machines 

Samuel J. Gershman/* * Eric J. Horvitz,^* Joshua B. Tenenbaum^* 

After growing up together, and mostly growing apart in the second half of the 20th century, 
the fields of artificial intelligence (Al), cognitive science, and neuroscience are 
reconverging on a shared view of the computational foundations of intelligence that 
promotes valuable cross-disciplinary exchanges on questions, methods, and results. 

We chart advances over the past several decades that address challenges of perception 
and action under uncertainty through the lens of computation. Advances include the 
development of representations and inferential procedures for large-scale probabilistic 
inference and machinery for enabling reflection and decisions about tradeoffs in effort, 
precision, and timeliness of computations. These tools are deployed toward the goal of 
computational rationality: identifying decisions with highest expected utility, while 
taking into consideration the costs of computation in complex real-world problems in 
which most relevant calculations can only be approximated. We highlight key concepts with 
examples that show the potential for interchange between computer science, cognitive 
science, and neuroscience. 



I magine driving down the highway on your 
way to give an important presentation, when 
suddenly you see a traffic jam looming ahead. 
In the next few seconds, you have to decide 
whether to stay on your current route or take 
the upcoming exit— the last one for several miles— 
all while your head is swimming with thoughts 
about your forthcoming event. In one sense, this 
problem is simple: Choose the path with the 
highest probability of getting you to your event 
on time. However, at best you can implement 
this solution only approximately: Evaluating the 
full branching tree of possible futures with high 
uncertainty about what lies ahead is likely to be 
infeasible, and you may consider only a few of 
the vast space of possibilities, given the urgency 
of the decision and your divided attention. How 
best to make this calculation? Should you make a 
snap decision on the basis of what you see right 
now, or explicitly try to imagine the next several 
miles of each route? Perhaps you should stop 
thinking about your presentation to focus more 
on this choice, or maybe even pull over so you 
can think without having to worry about your 
driving? The decision about whether to exit has 
spawned a set of internal decision problems: how 
much to think, how far should you plan ahead, 
and even what to think about. 

This example highlights several central themes 
in the study of intelligence. First, maximizing some 
measure of expected utility provides a general- 
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purpose ideal for decision-making under uncer- 
tainty. Second, maximizing expected utility is 
nontrivial for most real-world problems, necessi- 
tating the use of approximations. Third, the choice 
of how best to approximate may itself be a decision 
subject to the expected utility calculus— thinking 
is costly in time and other resources, and some- 
times intelligence comes most in knowing how 
best to allocate these scarce resources. 

The broad acceptance of guiding action with 
expected utility, the complexity of formulating 
and solving decision problems, and the rise of 
approximate methods for multiple aspects of 
decision-making under uncertainty has motivated 
artificial intelligence (AI) researchers to take a 
fresh look at probability through the lens of com- 
putation. This examination has led to the devel- 
opment of computational representations and 
procedures for performing large-scale probabil- 
istic inference; methods for identifying best ac- 
tions, given inferred probabilities; and machinery 
for enabling reflection and decision-making about 
tradeoffs in effort, precision, and timeliness of 
computations under bounded resources. Analo- 
gous ideas have come to be increasingly important 
in how cognitive scientists and neuroscientists 
think about intelligence in human minds and 
brains, often being explicitly influenced by AI re- 
searchers and sometimes influencing them back. 
In this Review, we chart this convergence of ideas 
around the view of intelligence as computational 
rationality: computing with representations, algo- 
rithms, and architectures designed to approximate 
decisions with the highest expected utility, while 
taking into account the costs of computation. We 
share our reflections about this perspective on 
intelligence, how it encompasses interdiscipli- 
nary goals and insights, and why we think it will 
be increasingly useful as a shared perspective. 



Models of computational rationality are built 
on a base of inferential processes for perceiving, 
predicting, learning, and reasoning under uncer- 
tainty ( 1 - 3 ). Such inferential processes operate on 
representations that encode probabilistic depen- 
dencies among variables capturing the likelihoods 
of relevant states in the world. In light of incom- 
ing streams of perceptual data, Bayesian updating 
procedures or approximations are used to prop- 
agate information and to compute and revise 
probability distributions over states of variables. 
Beyond base processes for evaluating probabil- 
ities, models of computational rationality require 
mechanisms for reasoning about the feasibility 
and implications of actions. Deliberation about 
the best action to take hinges on an ability to make 
predictions about how different actions will in- 
fluence likelihoods of outcomes and a considera- 
tion of the value or utilities of the outcomes ( 4 ). 
Learning procedures make changes to parame- 
ters of probabilistic models so as to better explain 
perceptual data and provide more accurate in- 
ferences about likelihoods to guide actions in 
the world. 

Last, systems with bounded computational 
power must consider important tradeoffs in the 
precision and timeliness of action in the world. 
Thus, models of computational rationality may 
include policies or deliberative machinery that 
make inferences and decisions at the “metalevel” 
in order to regulate base-level inferences. These 
decisions rely on reflection about computational 
effort, accuracy, and delay associated with the 
invocation of different base-level algorithms in dif- 
ferent settings. Such metalevel decision-making, 
or “metareasoning,” can be performed via real- 
time reflection or as policies computed during 
offline optimizations. Either way, the goal is to 
identify configurations and uses of base-level pro- 
cesses with the goal of maximizing the expected 
value of actions taken in the world. These com- 
putational considerations become increasingly 
important when we consider richer representa- 
tions (graphs, grammars, and programs) that sup- 
port signature features of human intelligence, such 
as recursion and compositionality (5). 

Key advances in AI on computational machin- 
ery for performing inference, identifying ideal 
actions, and deliberating about the end-to-end 
operation of systems have synergies and reso- 
nances with human cognition. After a brief his- 
tory of developments in AI, we consider links 
between computational rationality and findings 
in cognitive psychology and neuroscience. 

Foundations and early history 

AI research has its roots in the theory of compu- 
tability developed in the 1930s. Efforts then high- 
lighted the power of a basic computing system 
(the Turing Machine) to support the real-world 
mechanization of any feasible computation ( 6 ). 
The promise of such general computation and the 
fast-paced rise of electronic computers fueled the 
imagination of early computer scientists about 
the prospect of developing computing systems 
that might one day both explain and replicate 
aspects of human intelligence (7, 8 ). 
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Fig. 1. Examples of modern Al systems that use approximate inference and decision-making. 

These systems cannot rely on exhaustive enumeration of all relevant utilities and probabilities. Instead, 
they must allocate computational resources (including time and energy) to optimize approximations 
for inferring probabilities and identifying best actions. (A) The internal state of IBM Watson as it 
plays Jeopardy], representing a few high-probability hypotheses. [Photo by permission of IBM News 
Room] (B) The internal state of the Google self-driving car, which represents those aspects of the world 
that are potentially most valuable or costly for the agent in the foreseeable future, such as the po- 
sitions and velocities of the self-driving car, other cars and pedestrians, and the state of traffic signals. 
[Reprinted with permission from Google] (C) The Assistant (left), an interactive automated secretary 
fielded at Microsoft Research, recognizes multiple people in its proximity (right): deliberates about their 
current and future goals, attention, and utterances: and engages in natural dialog under uncertainty. 
[Permission from Microsoft] 



Early pioneers in AI reflected about uses of 
probability and Bayesian updating in learning, 
reasoning, and action. Analyses by Cox, Jaynes, 
and others had provided foundational argu- 
ments for probability as a sufficient measure 
for assessing and revising the plausibility of 
events in light of perceptual data. In influential 
work, von Neumann and Morgenstem published 
results on utility theory that defined ideal, or 
“rational,” actions for a decision-making agent 
(4). They presented an axiomatic formalization 
of preferences and derived the “principle of max- 
imum expected of utility” (MEU). Specifically, 
they showed that accepting a compact and com- 
pelling set of desiderata about preference order- 
ings implies that ideal decisions are those actions 
that maximize an agent’s expected utility, which is 
computed for each action as the average utility 
of the action when considering the probability 
of states of the world. 

The use of probability and MEU decision- 
making soon pervaded multiple disciplines, in- 
cluding some areas of AI research, such as projects 
in robotics. However, the methods did not gain 



a large following in studies of AI until the late 
1980s. For decades after the work by von Neumann 
and Morgenstem, probabilistic and decision- 
theoretic methods were deemed by many in the 
AI research community to be too inflexible, sim- 
plistic, and intractable for use in understand- 
ing and constructing sophisticated intelligent 
systems. Alternative models were explored, in- 
cluding logical theorem-proving and various 
heuristic procedures. 

In the face of the combinatorial complexity 
of formulating and solving real-world decision- 
making, a school of research on heuristic mod- 
els of bounded rationality blossomed in the 
later 1950s. Studies within this paradigm include 
the influential work of Simon and colleagues, 
who explored the value of informal, heuristic 
strategies that might be used by people— as 
well as by computer-based reasoning systems— 
to cut through the complexity of probabilistic 
inference and decision-making (9). The per- 
spective of such heuristic notions of bounded 
rationality came to dominate a large swath of 
AI research. 



Computational lens on probability 

In the late 1980s, a probabilistic renaissance 
swept through mainstream AI research, fueled 
in part by pressures for performing sound in- 
ference about likelihoods of outcomes in appli- 
cations of machine reasoning to such high-stakes 
domains as medicine. Attempts to mechanize 
probability for solving challenges with inference 
and learning led to new insights about proba- 
bility and stimulated thinking about the role of 
related representations and inference strategies 
in human cognition. Perhaps most influentially, 
advances in AI led to the formulation of rich 
network-based representations, such as Bayesian 
networks, broadly referred to as probabilistic 
graphical models (PGMs) (1, 2). Belief updating 
procedures were developed that use parallel and 
distributed computation to update constellations 
of random variables in the networks. 

The study of PGMs has developed in numer- 
ous directions since these initial advances: effi- 
cient approximate inference methods; structure 
search over combinatorial spaces of network struc- 
tures; hierarchical models for capturing shared 
structure across data sets; active learning to 
guide the collection of data; and probabilistic 
programming tools that can specify rich, context- 
sensitive models via compact, high-level pro- 
grams. Such developments have put the notions 
of probabilistic inference and MEU decision- 
making at the heart of many contemporary AI 
approaches (3) and, together with ever-increasing 
computational power and data set availability, 
have been responsible for dramatic AI successes 
in recent years (such as IBM’s Watson, Google’s 
self-driving car, and Microsoft’s automated as- 
sistant). These developments also raise new com- 
putational and theoretical challenges: How can 
we move from the classical view of a rational agent 
who maximizes expected utility over an exhaus- 
tively enumerable state-action space to a theory 
of the decisions faced by resource-bounded AI 
systems deployed in the real world (Fig. 1), which 
place severe demands on real-time computation 
over complex probabilistic models? 

Rational decisions under bounded 
computational resources 

Perception and decision-making incur computa- 
tional costs. Such costs may be characterized in 
different ways, including losses that come with 
delayed action in time-critical settings, interfer- 
ence among multiple inferential components, 
and measures of effort invested. Work in AI has 
explored the value of deliberating at the meta- 
level about the nature and extent of perception 
and inference. Metalevel analyses have been aimed 
at endowing computational systems with the 
ability to make expected utility decisions about 
the ideal balance between effort or delay and 
the quality of actions taken in the world. The use 
of such rational metareasoning plays a central 
role in decision-theoretic models of bounded ra- 
tionality {10-14). 

Rational metareasoning has been explored in 
multiple problem areas, including guiding computa- 
tion in probabilistic inference and decision-making 
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{11, 13, 14), controlling theorem proving (i5), 
handling proactive inference in light of incom- 
ing streams of problems (16), guiding heuristic 
search (13, 17), and optimizing sequences of ac- 
tion (18-20). Beyond real-time metareasoning, 
efforts have explored offline analysis to learn 
and optimize policies for guiding real-time meta- 
reasoning and for enhancing real-time inference 
via such methods as precomputing and caching 
portions of inference problems into fast-response 
reflexes (21). 

The value of metalevel reflection in computa- 
tional rationality is underscored by the complex- 
ity of probabilistic inference in Bayesian networks, 
which has been shown to be in the nondeter- 
ministic polynomial-time (NP)-hard complexity 
class (22). Such worst-case complexity highlights 
the importance of developing approximations 
that exploit the structure of real-world problems. 
A tapestry of approximate inferential methods 
have been developed, including procedures that 
use Monte Carlo simulation, bounding methods, 
and methods that decompose problems into 
simpler sets of subproblems (1). Some methods 
allow a system to trade off computation time 
for accuracy. For example, sampling procedures 
can tighten the bounds on probabilities of inter- 
est with additional computation time. Charac- 
terizations of the tradeoffs can be uncertain in 
themselves. Other approaches to approximation 
consider tradeoffs incurred with modulating the 
complexity of models, such as changing the size 
of models and the level of abstraction of evi- 
dence, actions, and outcomes considered (11, 21). 

A high-level view of the interplay between the 
value and cost of inference at different levels of 
precision is captured schematically in Fig. 2A. 



A 



Here, the value of computing with additional 
precision on final actions and cost of delay for 
computation are measured in the same units of 
utility. A net value of action is derived as the 
difference between the expected value of action 
based on a current analysis and the cost of com- 
putation required to attain the level of analysis. 
In the situation portrayed, costs increase in a 
linear manner with a delay for additional com- 
putation, while the value of action increases with 
decreasing marginal returns. We see the attain- 
ment of an optimal stopping time, in which at- 
tempts to compute additional precision come at 
a net loss in the value of action. As portrayed in 
the figure, increasing the cost of computation 
would lead to an earlier ideal stopping time. In 
reality, we rarely have such a simple economics 
of the cost and benefits of computation. We are 
often uncertain about the costs and the expected 
value of continuing to compute and so must 
solve a more sophisticated analysis of the ex- 
pected value of computation. A metalevel rea- 
soner considers the current uncertainties, the 
time-critical losses with continuing computation, 
and the expected gains in precision of reasoning 
with additional computation. 

As an example, consider a reasoning system 
that was implemented to study computational 
rationality for making inferences and providing 
recommendations for action in time-critical med- 
ical situations. The system needs to consider the 
losses incurred with increasing amounts of delay 
with action that stems from the time required for 
inference about the best decision to take in a 
setting. The expected value of the best decision 
may diminish as a system deliberates about a 
patient’s symptoms and makes inferences about 



B 



physiology. A trace of a reasoning session guided 
by rational metareasoning of a time-critical 
respiratory situation in emergency medicine is 
shown in Fig. 2B (14). An inference algorithm 
(named Bounded Conditioning) continues to 
tighten the upper and lower bounds on a critical 
variable representing the patient’s physiology, 
using a Bayesian network to analyze evidence. 
The system is uncertain about the patient’s state, 
and each state is associated with a different time 
criticality and ideal action. The system continues 
to deliberate at the metalevel about the value of 
continuing to further tighten the bounds. It mon- 
itors this value via computation of the expected 
value of computation. When the inferred ex- 
pected value of computation goes to zero, the 
metalevel analysis directs the base-level system 
to stop and take the current best inferred base- 
level action possible. 

Computational rationality in mind 
and brain 

In parallel with developments in AI, the study 
of human intelligence has charted a similar 
progression toward computational rationality. 
Beginning in the 1950s, psychologists proposed 
that humans are “intuitive statisticians,” using 
Bayesian decision theory to model intuitive 
choices under uncertainty (23). In the 1970s and 
1980s, this hypothesis met with resistance from 
researchers who uncovered systematic fallacies 
in probabilistic reasoning and decision-making 

(24) , leading some to adopt models based on in- 
formal heuristics and biases rather than norma- 
tive principles of probability and utility theory 

(25) . The broad success of probabilistic and 
decision-theoretic approaches in AI over the past 

two decades, however, has helped 
to return these ideas to the center 
of cognitive modeling (5, 26-28). 
The development of methods for 
approximate Bayesian updating via 
distributed message passing over 
large networks of variables suggests 
that similar procedures might be 
used for large-scale probabilistic in- 
ference in the brain (29). At the same 
time, researchers studying human 
judgment and decision-making con- 
tinue to uncover ways in which peo- 
ple’s cognitive instincts appear far 
from the MEU ideals that econo- 
mists and policymakers might have 
hoped for. 

Computational rationality offers 
a framework for reconciling these 
contradictory pictures of human in- 
telligence. If the brain is adapted to 
compute rationally with bounded 
resources, then “fallacies” may arise 
as a natural consequence of this 
optimization (30). For example, a 
generic strategy for approximating 
Bayesian inference is by sampling 
hypotheses, with the sample-based 
approximation converging to the true 
posterior as more hypotheses are 



Value of result in 





Fig. 2. Economics of thinking in computational rationality. (A) Systems must consider the expected value and 
cost of computation. Flexible computational procedures allow for decisions about ideal procedures and stopping 
times (f*) in order to optimize the net value of action (u*). In the general case, the cost-free value associated with 
obtaining a computed result at increasing degrees of precision and the cost of delay with computation are 
uncertain (indicated by the bell curve representing a probability distribution). Thus, the time at which further 
refinement of the inference should stop and action should be taken in the world are guided by computation of the 
expected value of computation. [Adapted from (16) with permission] (B) Trace of rational metareasoning in a 
time-critical medical setting. [Adapted from (14) with permission] A bounding algorithm continues to tighten the 
upper bound (UB) and lower bound (LB) on an important variable representing a patient’s physiology. When a 
continually computed measure of the value of additional computation (red line) goes to zero, the base-level model 
is instructed to make a recommendation for immediate action. 
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Fig. 3. Resource-constrained sampling in human cognition. (A) Incre- 
mental parsing of a garden-path sentence. [Adapted from (36)] (Top) Evo- 
lution of the posterior probability for two different syntactic parses (shown 
in boxes). The initially favored parse is disfavored by the end of the sen- 
tence. (Bottom) A resource-constrained probabilistic parser (based on a 
particle filter with few particles) may eliminate the initially unlikely parse 
and therefore fail to correctly parse the sentence by the end, as shown by 
the proportion of particle filters with 20 particles that successfully parse 
the sentence up to each word. (B) Sample-based inference for multiple- 
object tracking. In this task, subjects are asked to track a subset of dots 
over time, marked initially in red. Lines denote velocity, and circles denote 
uncertainty about spatial transitions. In the second frame, red shading 



indicates the strength of belief that a dot is in the tracked set (More red = 
higher confidence) after some time interval A, for a particle-filter object 
tracker. Uncertainty scales with velocity, explaining why high-velocity ob- 
jects are harder to track (32). (C) For a sampling-based approximate Bayesian 
decision-maker, facing a sequence of binary choices, expected utility per 
decision, and number of decisions per unit time can be combined to 
compute the expected utility per unit time as a function of the number of 
posterior samples and action/sample cost ratios. Circles in the rightmost 
graph indicate the optimal number of samples at a particular action/sample 
cost ratio. For many decisions, the optimal choice tradeoff between accuracy 
and computation time suggests deciding after only one sample. [Reprinted 
from (38) with permission] 



sampled (i). Evidence suggests that humans 
use this strategy across several domains, includ- 
ing causal reasoning {31), perception {32, 33), 
and category learning {34). Sampling algorithms 
can also be implemented in biologically plausible 
neural circuits (35), providing a rational expla- 
nation for the intrinsic stochasticity of neurons. 

We see a correspondence between the sam- 
pling algorithms humans appear to use and those 
used in state-of-the-art AI systems. For example, 
particle filters— sequential sampling algorithms 



for tracking multiple objects moving in a dynamic 
uncertain environment (35)— are at the heart of 
the Google self-driving car’s picture of its sur- 
roundings (Fig. IB) and also may describe how 
humans track multiple objects (Fig. 3B) {32). 
When only a small number of hypotheses are 
sampled, various biases emerge that are con- 
sistent with human behavior. For instance, “gar- 
den path” effects in sentence processing, in 
which humans perseverate on initially promising 
hypotheses that are disconfirmed by subse- 



quent data, can be explained by particle filters 
for approximate online parsing in probabilistic 
grammars (Fig. 3A) (37). These biases may in 
fact be rational under the assumption that sam- 
pling is costly and most gains or losses are small, 
as in many everyday tasks; then, utility can be 
maximized by sampling as few as one or a few 
high-posterior probability hypotheses for each 
decision (Fig. 3C) (38). 

This argument rests crucially on the assertion 
that the brain is equipped with metareasoning 
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mechanisms sensitive to the costs of cognition. 
Some such mechanisms may take the form of 
heuristic policies hardwired by evolutionary 
mechanisms; we call these “heuristic” because 
they would be metarational only for the range 
of situations that evolution has anticipated. There 
is also evidence that humans have more adaptive 
metareasoning mechanisms sensitive to the costs 
of cognition in online computation. In recent 
work with the “demand selection” task (39-41), 
participants are allowed to choose between two 
cognitive tasks that differ in cognitive demand 
and potential gains. Behavioral findings show 
that humans trade off reward and cognitive ef- 
fort rationally according to a joint utility function 
(40). Brain imaging of the demand selection task 
has shown that activity in the lateral prefrontal 
cortex, a region implicated in the regulation of 
cognitive control, correlates with subjective re- 
ports of cognitive effort and individual differences 
in effort avoidance (41). 

Several recent studies have provided support 
for rational metareasoning in human cognition 
when computational cost and reward tradeoffs 
are less obvious (42, 43). As an example, humans 
have been found to consistently choose list- 
sorting strategies that rationally trade time and 
accuracy for a particular list type (42). This study 
joins earlier work that has demonstrated adap- 
tive strategy selection in humans (44, 45) but 
goes beyond them by explicitly modeling strategy 
selection using a measure of the value of com- 
putation. In another study (46), humans were 
found to differentially overestimate the frequency 
of highly stressful life events (such as lethal acci- 
dents and suicide). This “fallacy” can be viewed 
as rational under the assumption that only a small 
number of hypotheses can be sampled: Expected 
utility is maximized by a policy of utility-weighted 
sampling. 

Computational tradeoffs in 
sequential decision-making 

Computational rationality has played an impor- 
tant role in linking models of biological intelli- 
gence at the cognitive and neural levels in ways 
that can be seen most clearly in studies of se- 
quential decision-making. Humans and other 
animals appear to make use of different kinds 
of systems for sequential decision-making: “model- 
based” systems that use a rich model of the 
environment to form plans, and a less complex 
“model-free” system that uses cached values to 
make decisions (47). Although both converge 
to the same behavior with enough experience, 
the two kinds of systems exhibit different trade- 
offs in computational complexity and flexibility. 
Whereas model-based systems tend to be more 
flexible than the lighter-weight model-free sys- 
tems (because they can quickly adapt to changes 
in environment structure), they rely on more ex- 
pensive analyses (for example, tree-search or dy- 
namic programming algorithms for computing 
values). In contrast, the model-free systems use 
inexpensive, but less flexible, look-up tables or 
function approximators. These efforts have con- 
ceptual links to efforts in AI that have sought to 



reduce effort and to speed up responses in real 
time by optimizing caches of inferences via off- 
line precomputation (21). 

Studies provide evidence that model-based and 
model-free systems are used in animal cognition 
and that they are supported by distinct regions 
of the prefrontal cortex (48) and striatum (49). 
Evidence further suggests that the brain achieves 
a balance between computational tradeoffs by 
using an adaptive arbitration between the two 
kinds of systems (50, 51). One way to implement 
such an arbitration mechanism is to view the 
invocation of the model-based system as a meta- 
action whose value is estimated by the model- 
free system (51). 

Early during learning to solve a task, when the 
model-free value estimates are relatively inaccu- 
rate, the benefits of using the model-based sys- 



tem outweigh its cognitive costs. Thus, moderately 
trained animals will be sensitive to changes in 
the causal structure of the environment (for 
example, the devaluation of a food reinforcer by 
pairing it with illness). After extensive training, 
the model-free values are sufficiently accurate 
to attain a superior cost-benefit tradeoff (46). 
This increasing reliance on the model-free system 
manifests behaviorally in the form of “habits”— 
computationally cheap but inflexible policies. 
For example, extensively trained animals will 
continue pursuing a policy that leads to previously 
devalued reinforcers (52). 

The arbitration mechanism described above 
appears to adhere to the principles of compu- 
tational rationality: The model-based system is 
invoked when deemed computationally advan- 
tageous through metareasoning (Fig. 4A). For 
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Fig. 4. Computational tradeoffs in use of different decision-making systems. (A) A fast but inflexible 
model-free system stores cached values in a look-up table but can also learn to invoke a slower but more 
flexible model-based system that uses forward search to construct an optimal plan. The cached value for 
invoking the model-based system (highlighted in green) is a simple form of metareasoning that weighs the 
expected value of forward search against time and effort costs. (B) Hippocampal place cell-firing patterns 
show the brain engaged in forward search at a choice point, sweeping ahead of the animal’s current 
location. Each image shows the time-indexed representation intensity of locations in pseudocolor (red, 
high probability: blue, low probability). The representation intensity is computed by decoding spatial 
location from ensemble recordings in hippocampal area CAS. [Reprinted with permission from (56)] 
(C) Similar principles apply to more complex decision problems, such as Atari and Go (left). AI systems 
(right) use complex value function approximation architectures, such as deep convolutional nets (top 
right) [reprinted with permission from (60)], for model-free control, and sophisticated forward search 
strategies, such as Monte Carlo Tree Search (bottom right) [reprinted with permission from (6J)], for 
model-based control. 
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example, reliance on the model-based system 
decreases when the availability of cognitive 
resources are transiently disrupted {53). Recent 
data show that the arbitration mechanism may 
be supported by the lateral prefrontal cortex (54), 
the same region involved in the registration of 
cognitive demand. 

Finer-grained metareasoning may play a role 
within the richer model-based systems them- 
selves. One way to approximate values is to adapt 
the sampling hypothesis to the sequential deci- 
sion setting, stochastically exploring trajectories 
through the state space and using these sample 
paths to construct a Monte Carlo estimator. Re- 
cently, a class of sampling algorithms known as 
Monte Carlo Tree Search (MCTS) has gained con- 
siderable traction on complex problems by bal- 
ancing exploration and exploitation to determine 
which trajectories to sample. MCTS has achieved 
state-of-the-art performance in computer Go as 
well as a number of other difficult sequential 
decision problems (55). A recent study analyzed 
MCTS within a computational rationality frame- 
work and showed how simulation decisions 
can be chosen to optimize the value of com- 
putation {20). 

There is evidence that the brain might use 
an algorithm resembling MCTS to solve spatial 
navigation problems. In the hippocampus, “place 
cells” respond selectively when an animal is in 
a particular spatial location and are activated 
sequentially when an animal considers two dif- 
ferent trajectories (Fig. 4B) (55). Pfeiffer and Foster 
(57) have shown that these sequences predict an 
animal’s immediate behavior, even for new start 
and goal locations. It is unknown whether for- 
ward sampling observed in place cells balances 
exploration and exploitation as in MCTS, explor- 
ing spatial environments the way MCTS explores 
game trees, or whether they are sensitive to the 
value of computation. These are important stand- 
ing questions in the computational neuroscience 
of decision-making. 

At the same time, AI researchers are beginning 
to explore powerful interactions between model- 
based and model-free decision-making systems 
parallel to the hybrid approaches that computa- 
tional cognitive neuroscientists have investigated 
(Fig. 4C). Model-free methods for game-playing 
based on deep neural networks can, with exten- 
sive training, match or exceed model-based MCTS 
approaches in the regimes that they have been 
trained on {58). Yet, combinations of MCTS and 
deep-network approaches beat either approach 
on its own {59) and may be a promising route to 
explain how human decision-making in complex 
sequential tasks can be so accurate and so fast 
yet still flexible to replan when circumstances 
change— the essence of acting intelligently in an 
uncertain world. 

Looking forward 

Computational rationality offers a potential uni- 
fying framework for the study of intelligence in 
minds, brains, and machines, based on three 
core ideas: that intelligent agents fundamen- 
tally seek to form beliefs and plan actions in 



support of maximizing expected utility; that 
ideal MEU calculations may be intractable for 
real-world problems, but can be effectively ap- 
proximated by rational algorithms that maximize 
a more general expected utility incorporating the 
costs of computation; and that these algorithms 
can be rationally adapted to the organism’s spe- 
cific needs, either offline through engineering 
or evolutionary design, or online through meta- 
reasoning mechanisms for selecting the best 
approximation strategy in a given situation. 
We discussed case studies in which these ideas 
are being fruitfully applied across the disci- 
plines of intelligence, but we admit that a ge- 
nuine unifying theory remains mostly a promise 
for the future. We see great value in pursuing 
new studies that seek additional confirmation 
(or disconfirmation) of the roles of machinery for 
cost-sensitive computation in human cogni- 
tion, and for enabling advances in AI. Although 
we cannot foresee precisely where this road 
leads, our best guess is that the pursuit itself is 
a good bet— and as far as we can see, the best 
bet that we have. 
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Polar bears cannot resist 
summer food shortages 

Whiteman et al., p. 295 
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THERMAL PHYSIOLOGY 

Keeping cool 

S ilver ants inhabit one of the hottest and (driest environments on Earth, the Saharan san(ds, 
where most insects shrivel and die moments after contact. Shi et al. show that the triangular 
shape of the silver hairs that cover their bodies enables this existence. The hairs both increase 
the reflection of near-infrared rays and dissipate heat from the ants’ bodies, even under full 
sun conditions. Evolution’s simple solution to intense heat management in this species could 
lead to better designs for passive cooling of human-produced objects. — SNV 

Science, this issue p. 298 



HIV-l VACCINES 

To defeat SIV, add a 
protein boost 

Despite 30 years of effort, no 
HIV-1 vaccine exists. Barouch 
etal. evaluated one promising 
strategy in rhesus macaques, 
a preclinical model commonly 
used to test potential HIV-1 
vaccine candidates. They 
immunized monkeys with adeno- 
virus-36 vectors engineered 



to express SIV (simian immu- 
nodeficiency virus) genes and 
then boosted them with a 
recombinant gpl20 envelope 
glycoprotein (Env) from SIV. This 
regimen afforded greater protec- 
tion than a strategy that instead 
used a viral vector-based boost. 
A parallel trial using a SHIV (sim- 
ian/human immunodeficiency 
virus)-based vaccine and chal- 
lenge model produced similar 
results. Whether this particular 



approach will be equally suc- 
cessful in humans remains to be 
tested. — KLM 

Science, this issue p. 320 

MAGNETISM 

Skyrmions emerge 
in trilayers 

Skyrmions are tiny whirlpools of 
magnetic spin with potential to 
act as carriers of information in 



future devices. Skyrmions have 
been observed in multiple mate- 
rials but usually at impractically 
low temperatures. Jiang et al. 
used a constriction in a trilayer 
system to create skyrmions 
at room temperature (see the 
Perspective by von Bergmann). 
The authors pushed elongated 
magnetic domains through the 
constriction using an in-plane 
current, causing individual skyr- 
mion bubbles to form. — JS 

Science, this issue p. 283; 

see also p. 234 

NANOPARTICLE IMAGING 

Looking at teeny tiny 
platinum particies 

Electron microscopy is a 
powerful technique for taking 
snapshots of particles or images 
at near-atomic resolution. Park et 
al. studied free-floating platinum 
nanoparticles using electron 
microscopy and liquid cells (see 
the Perspective by Colliex). Using 
analytical techniques developed 
to study biological molecules, 
they reconstructed the three- 
dimensional features of the Pt 
particles at near-atomic resolu- 
tion. This approach has the scope 
to study a mixed population of 
particles one at a time and to 
study their synthesis as it occurs 
in solution. — MSL 

Science, this issue p. 290; 

see also p. 232 

PLANT SCIENCE 

Substrate channeling in 
morphine biosynthesis 

Poppies are still the most 
economically viable source of 
the excellent painkiller mor- 
phine. Winzer et al. have now 
identified a key enzyme in the 
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Tetrahymena uses 
small RNAs to fend 
off transposable 
elements 









GENOME DEFENSE ■ ^ 

Hunter RNAs seek 
and destroy parasitic DMAs 

T ransposable elements (TEs) are parasitic DNA sequences. 
They present a serious threat to the host genomes in 
which they reside. Like other eukaryotes, the single-celled 
protist Tetrahymena uses small RNAs to control parasitic 
TEs. Noto etal. show that a subset of these small RNAs 
recognize the TEs they are synthesized from as foreign, as well 
as sequence-related TEs in other places in the genome. These 
RNAs stimulate the production of a second wave of small RNAs 
at these distant sites. The second-wave, or “late,” RNAs can bind 
to still further TE sequences, ensuring a robust and heritable 
defense of the entire genome. — GR 

Mol. Ce//59,10.1016/j.molcel.2015.05.024(2015). 



poppy’s biosynthetic pathway 
for morphine. The enzyme turns 
out to be an unusual protein that 
contains both cytochrome P-450 
and oxidoreductase modules. 
Together these modules process 
two subsequent steps in the 
biosynthetic pathway. The 
identification of this enzyme 
may enable alternate routes 
for morphine biosynthesis that 
are less dependent on poppy 
cultivation. — PJH 

Science, this issue p. 309 

SEX DETERMINATION 

How germ cells 
become sperm or egg 

During vertebrate development, 
germ cells switch from a sexually 
indifferent to a committed state 
for either egg or sperm. Signals 
from somatic gonadal cells are 
generally thought to influence 
the sexual differentiation of 
germ cells. However, Nishimura 
etal. demonstrate that germ 
cell-intrinsic sex determination 
cues are at play in the teleost 
fish medaka. The forkhead box 
transcriptional factor foxl3 
represses the initiation of sper- 
matogenesis. In the absence of 
foxl3 function, females develop 
ovaries filled with functional 
sperm. Thus, the male gonad 
environment is not required for 
spermatogenesis.— BAP 

Science, this issue p. 328 

ICE SHEETS 

Movers and shakers 

When the edge of an ice sheet 
breaks off and falls into the sea 
(calves), the remaining section 



Icebergs near 
the terminus of 
Helheim Glacier 




of the ice sheet moves backward 
and down and can suffer a glacial 
earthquake. Murray etal. studied 
calving from Greenland’s Helheim 
Glacier. The forces that cause the 
change in the motion of the ice 
sheet at its terminus also trigger 
the accompanying earthquakes. 
Because these seismic signals 
can be detected by instruments 
located all over the globe, it 
should be possible to use these 
glacial earthquakes as proxies for 
glacier calving. — HJS 

Science, this issue p. 305 

DRUG DISCOVERY 

Long-acting drug to 
treat resistant malaria 

Malaria kills 0.6 million people 
annually. Currently available 
drugs are no longer fully effec- 
tive, because the malarial 
parasite has developed resis- 
tance. Now, Phillips etal. have 
identified a drug, DSM265, that 
kills both drug-sensitive and 
drug-resistant parasites by tar- 
geting their ability to synthesize 
precursors required for synthe- 
sis of DNA and RNA. DSM265 
kills parasites in the blood and 
the liver and is sufficiently long- 
acting that it could potentially 
cure malaria after a single dose 
or provide effective chemopre- 
vention if given weekly. — QMS 
Sci. Transl. Med. 7, 296ralll (2015). 

INFECTIOUS DISEASE 

GBS toxin activates mast 
cells for host defense 

Ascending Group B streptococ- 
cus (GBS) is a major cause of 
preterm birth. How the mother 
defends against GBS infec- 
tions is not clear. Working in 
mice, Rajagopal et al. found that 
immune cells, called mast cells, 
are activated by a lipid toxin 
produced by GBS. This lipid toxin 
is an ornithine rhamno-polyene 
that stains the bacteria red. The 
toxin assists penetration of the 
placenta by the bacteria. The 
toxin also stimulates mast cell 
degranulation, an early step in 
host efforts to defeat the bacte- 
rial infection. — PLY 

Sci.Adv. 10.1126/ 
sciadv.l400225(2015). 



ASTROPHYSICS 

X-ray echoes used 
as a cosmic yardstick 

Determining accurate distances 
to astronomical x-ray sources 
is notoriously difficult, but 
Heinz etal. have developed a 
new technique for doing so. 



They exploited “light echoes” 
from the object CirX-1, in which 
x-rays emitted during a large 
flare bounced off foreground 
dust and appeared months later 
as delayed rings of emission 
around the source. By compar- 
ing the x-ray rings to radio data, 
they were able to identify the 
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foreground clouds of dust and 
thereby measure the distance to 
Cir X-1 with an accuracy of 10%. 
Previous estimates had varied 
by almost a factor of 3. — KTS 

Astrophys. J. 806 , 265 (2015). 



PHYSICS 

A gap in a topological 
surface state 

Topological insulators (TIs) 
have, in theory, conducting 
surfaces and insulating bulks. 

In practice, TIs often have a 
considerable bulk conductance 
that masks the conduction from 
the topologically protected 
surface states. To make the 
surface state nonconducting— in 
other words, to open a gap in its 
dispersion— researchers usually 
resort to magnetic doping. Now, 
Weber et al. demonstrate that a 
topological surface state of the 
compound B\^Se^ is naturally, 
although partially, gapped. To 
show this, the authors use a 
combination of photoemission 
measurements and calculations. 
The results may be more general 
within the subclass of TIs to 
which Bi 4 Sc 3 belongs. — JS 

Phys. Rev. Lett 10.1103/ 
PhysRevLett.114.256401 (2015). 



PROTEIN FOLDING 

Trapped on the 
wrong pathway 

Protein folding can be described 
as diffusion over an energy 
landscape in conformational 
space to reach an energy 
minimum that represents a 
stable folded structure, and it is 
implicated in many diseases. A 
particularly dramatic example is 
the prion protein, PrP, that folds 
rapidly into a native monomeric 
structure but also has a stable 
oligomeric form that causes 
prion disease. Yu et al. used 
I single-molecule force spectros- 
^ copy to monitor misfolding of a 
I PrP dimer. The dimer misfolds 
w along a single pathway involv- 
I ing several intermediates, one 
8 of which blocks native folding. 

J Diffusion across the energy 
p landscape is 1000 times slower 
I than for the folding of native 



prion monomers, probably 
indicating that many unproduc- 
tive interactions occur during 
misfolding. — VV 

Proc. Natl. Acad. Sci. U.S.A. 10.1073/ 
pnas.l419197112(2015). 



GLOBAL CARBON CYCLE 

Seeing the forest 
from the trees 

Boreal forests contain nearly 
half of the carbon stored by the 
trees of the world, and thus are a 
critical component of the global 
carbon cycle. How much carbon 
is contained in this part of the 
ecosystem still is inadequately 
known, though. Chen and Luo 
studied the net annual above- 
ground biomass change of four 



major boreal forest types in 
western Canada over the period 
between 1958 and 2011 and 
found that it declined for all of 
those groups. They attribute the 
loss to increased tree mortality 
and reduced growth, caused 
mostly by persistent warming 
and decreasing water availabil- 
ity, trends that are expected to 
escalate in the future. — HJS 

Global Change Biol. 10.1111/ 
gcb.l2994(2015). 



NEUROSCIENCE 

Contextual memory 
networks in monkey brains 

Memory tasks activate a number 
of areas in the prefrontal cortex 
of the brain. Lesions in this region 



cause memory deficits: however, 
lesions in only a fraction of the 
activated areas actually lead 
to severe memory loss. Osada 
etal. measured whole-brain 
activity during a memory task 
in monkeys. The activated areas 
and their task-specific functional 
connectivity formed a hierarchical 
network centered on a hub. This 
functional hub largely matched 
the documented sites where 
lesions had the most dramatic 
effect. Neighboring sites where 
lesions had no impact were much 
less affected. The functional con- 
nections of an area predicted the 
degree of memory loss much bet- 
ter than its role in the anatomical 
network. — PRS 



PLOSBiol. 10.1371/journal. 
pbio.l002177(2015). 






















Angiopoietin-l expression varies 
with brain size in guppies 



BRAIN EVOLUTION 

The genetic underpinnings of brain size 

hat genes and selective processes lead to an increase in brain size and intelligence is one of 
the greatest questions in human evolution. Chen et al. present findings in fish that may help 
us understand this process. When guppies were selected for brain size, expression differ- 
ences were observed in the /\ng/opo/et/n-l (Ang-1) gene. These differences seem to be due 
to noncoding variation: furthermore, lowering /\ng-l expression in the zebrafish also affected 
brain size. On the basis of these results, the authors suggest that further study of Ang-1 may help pro- 
vide insight into the evolution of brain size and cognition in other species, including humans. — LMZ 

Proc. R. Soc. London Sen B 10.1098/rspb.2015.0872 (2015). 
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CHROMOSOMES 

Protein partners for 
chromosome silencing 

Female mammals have two X 
chromosomes, one of which is 
almost completely shut down 
during development. The long 
noncoding Xist RNA plays a role 
in this process. To understand 
how a whole chromosome can 
be stably inactivated, Minaj’igi 
et al. identified many of the 
proteins that bind to the Xist 
RNA, which include cohesins. 
Paradoxically, the interac- 
tion between Xist and cohesin 
subunits resulted in repulsion 
of cohesin complexes from the 
inactive X chromosome, chang- 
ing the three-dimensional shape 
of the whole chromosome. — GR 
Science, this issue p. 282 

PLANT ECOLOGY 

Grassland diversity and 
ecosystem productivity 

The relationship between plant 
species diversity and ecosystem 
productivity is controversial. 

The debate concerns whether 
diversity peaks at intermediate 
levels of productivity— the so- 
called humped-back model— or 
whether there is no clear predict- 
able relationship. Fraser eta/, 
used a large, standardized, and 
geographically diverse sample of 
grasslands from six continents to 
confirm the validity and general- 
ity of the humped-back model. 
Their findings pave the way for a 
more mechanistic understanding 
of the factors controlling species 
diversity. — AMS 

Science, this issue p. 302 

HEAVY FERMIONS 

Probing the insulating 
state of SmBe 

When a metal is subjected to 
a strong magnetic field, its 



electrons start rearranging 
into new energy levels, caus- 
ing its electronic properties to 
oscillate as a function of the 
field. Unexpectedly, Tan et al. 
observed this phenomenon, 
called quantum oscillations, in 
the Kondo insulator samarium 
hexaboride (SmBg), which does 
not conduct electricity. They 
measured the magnetic torque 
and detected quantum oscilla- 
tions originating from the bulk of 
this heavy fermion compound. 
These oscillations had an 
unusual temperature depen- 
dence, which presents another 
puzzle to theorists seeking to 
understand the nature of the 
insulating state of SmB^. — JS 

Science, this issue p. 287 

ANIMAL PHYSIOLOGY 

Not that unusual after all 

As polar ice recedes, polar bears 
are facing a changed habitat 
with reduced summer forag- 
ing opportunities. It has been 
hypothesized that they might 
be able to resist summer food 
shortages by reducing their met- 
abolic needs in a sort of “walking 
hibernation.” Whiteman etal. 
monitored energy expenditure in 
polar bears both on and off the 
ice and found energy reductions, 
but that these were more akin 
to normal mammalian fasting 
levels. Thus, it appears that polar 
bears have no energetic protec- 
tions against reduced summer 
food supplies and will face 
increasing starvation threats if 
summer foraging habitats con- 
tinue to decline. — SNV 

Science, this issue p. 295 

CIRCADIAN RHYTHMS 

Biochemical basis of a 
24-hour clock 

Circadian clocks keep organisms 
in synch with such daily cycles 



as illumination, activity, and food 
availability. The circadian clock in 
cyanobacteria has the necessary 
24-hour period despite its three 
component proteins having bio- 
chemical activities that occur on 
a much faster time scale. Abe et 
al. focused on the cyanobacterial 
clock component KaiC, an ade- 
nosine triphosphatase (ATPase) 
that can autophosphorylate and 
autodephosphorylate. The slow 
ATPase activity of KaiC, which is 
linked to a peptide isomerisation, 
provided the slow kinetics that set 
the speed of the 24-hour clock. 
Chang et al. found that another 
clock component, KaiB, also 
has slow changes in its protein 
conformation that help to set the 
oscillation period of the clock and 
its signaling output. — LBR 

Science, this issue pp. 312 and 324 

ECOLOGY 

The benefits of diversity 

Pathogens and parasites are 
an integral part of all ecosys- 
tems. But the likelihood of 
them causing disease depends 
on environmental factors. In a 
Perspective, Keesing and Ostfeld 
highlight recent studies on 
disease prevalence as a function 
of diversity. In humans, other 
animals, and plants, disease 
prevalence is lower when diver- 
sity is high. This is the case even 
though pathogen diversity is 
higher in more-diverse systems. 
As biodiversity is lost from 
ecosystems, they become more 
vulnerable to infections. — JFU 
Science, this issue p. 235 

INFLAMMATION 

Neutrophil NETs drive 
atheroscierosis 

The buildup of fats, cholesterol, 
and other substances in arteries 
causes atherosclerosis, which 
restricts blood flow and can 



lead to heart attacks and stroke. 
Inflammation contributes to 
the pathogenesis of athero- 
sclerosis, but exactly how is not 
fully understood. Warnatsch 
et al. now show that immune 
cells called neutrophils release 
NETS (neutrophil extracellular 
traps) (see the Perspective by 
Nahrendorf and Swirski). These 
NETS are composed of DNA and 
antimicrobial proteins, and in the 
setting of atherosclerosis they 
activate innate immune signal- 
ing pathways in macrophages. 
This causes the macrophages to 
secrete proinflammatory cyto- 
kines, exacerbating the disease. 
Indirectly, NETS also attract a 
specialized subset of T cells that 
further amplify the proinflam- 
matory response. -KLM 

Sc/ence, this issue p. 316; 

see also p. 237 

IMMUNOLOGY 

Fine-tuning the 
infiammatory response 

The binding of the bacterial 
product lipopolysaccharide 
(EPS) to the receptor TLR4 on 
macrophages triggers inflam- 
matory responses that require 
the transcription factor NE-kB. 
TLR4 recruits the adaptor protein 
MyD88 when the receptor is at 
the plasma membrane and a dif- 
ferent adaptor protein, TRIE, after 
internalization into endosomes. 
Cheng et al. found that MyD88 
was required for the initial peak 
of transient NE-kB activation in 
all LPS-stimulated cells (see also 
the Eocus by Williams etal.). In 
contrast, TRIE was required for 
more sustained NE-kB activation 
in a subset of cells. Thus, macro- 
phages use both adaptor proteins 
to fine-tune NE-kB activation to 
induce an appropriate inflamma- 
tory response. — JEE 

Sci. Signal. 8 , ra69 and fsl3 (2014). 



281-B 17 JULY 2015 • VOL 349 ISSUE 6245 



sciencemag.org SCIENCE 



Published by AAAS 




RESEARCH 



RESEARCH ARTICLE SUMMARY 



CHROMOSOMES 

A comprehensive Xist interactome 
reveals cohesin repulsion and an RNA- 
directed chromosome conformation 



Anand Minajigi,* John E. Froberg,* Chunyao Wei, Hongjae Sunwoo, Barry Kesner, 
David Colognori, Derek Lessing, Bernhard Payer, Myriam Boukhali, 

Wilhelm Haas, Jeannie T. Leef 



INTRODUCTION: The mammal has 

evolved an epigenetic mechanism to 
silence one of two X chromosomes in the 
XX female to equalize gene dosages with 
the XY male. Once established, the inacti- 
vated X chromosome (Xi) is extremely 
stable and is maintained through the 
lifetime of the female mammal. The prin- 
cipal regulator, Xist, is a long noncoding 
RNA that orchestrates the silencing proc- 
ess along the Xi. Xist is believed to 
operate as a scaffold to recruit and spread 
repressive complexes, such as Polycomb 
Repressive Complex 2, along the X chro- 
mosome. The identities of crucial in- 
teracting factors, however, have remained 
largely unknown. 



RATIONALE: Although the Xi’s epigenetic 
stability is a necessary homeostatic prop- 
erty, an ability to unlock this epigenetic 
state is of great current interest. The X 
chromosome is home to nearly 1000 genes, 
at least 50 of which have been implicated in 
X-linked diseases, such as Rett syndrome 
and fragile X syndrome. The Xi is therefore 
a reservoir of functional genes that could be 
tapped to replace expression of a disease allele 
on the active X (Xa). A major gap in current 
understanding is the lack of a comprehensive 
Xist interactome. Progress toward a full 
interactome would advance knowledge of 
epigenetic regulation by long noncoding 
RNA and potentially inform treatment of X- 
linked diseases. 
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An operational model for how Xist RNA orchestrates the Xi state. Xist is a multitasking RNA 
that brings many layers of repression to the Xi. Although Xist RNA recruits repressive complexes 
(such as PRCl, PRC2, DNMTl, macroH2A, and SmcHDl) to establish and maintain the inactive 
state, it also actively repels activating factors and architectural proteins (such as the cohesins and 
CTCF) to avoid acquisition of a transcription-favorable chromatin conformation. 
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Read the full article 
at http://dx.doi. 
org/10.1126/ 
science. aab2276 



RESULTS: We have developed an RNA-centric 
proteomic method called iDRiP (identification 
of direct RNA-interacting proteins). 

Using iDRiP, we identified 80 to 200 pro- 
teins in the Xist interactome. The interactors 
fall into several functional categories, includ- 
ing cohesins, condensins, topoisomerases, RNA 
helicases, chromatin remodelers, histone modi- 
fiers, DNAmethyltransfer- 
ases, nucleoskeletal fectors, 
and nuclear matrix pro- 
teins. Targeted inhibition 
demonstrates that Xi si- 
lencing can be destabil- 
ized by disrupting multiple 
components of the interactome, consistent 
with the idea that these factors synergistical- 
ly repress Xi transcription. Triple-drug treat- 
ments lead to a net increase of Xi expression 
and up-regulation of -100 to 200 Xi genes. We 
then cany out a focused study of X-linked cohe- 
sin sites. Chromatin immunoprecipitation se- 
quencing analysis demonstrates three types of 
cohesin sites on the X chromosome: Xi-specific 
sites, Xa-specific sites, and biallelic sites. We 
find that the Xa-specific binding sites represent 
a default state. Ablating Xist results in restora- 
tion of Xa-specific sites on the Xi. These find- 
ings demonstrate that, while Xist attracts 
repressive complexes to the Xi, it actively repels 
chromosomal architectural factors such as the 
cohesins from the Xi. Finally, we examine 
how Xist and the repulsion of cohesins affect 
Xi chromosome structure. In wild-type cells, 
the Xa is characterized by -112 topologically 
associated domains (TADs) and the Xi by 
two megadomains. Intriguingly, loss of Xist 
and restoration of cohesin binding result in 
a reversion of the Xi to an Xa-like chro- 
mosome conformation. Hi-C analysis shows 
that TADs return to the Xi in a manner cor- 
related with the reappearance of cohesins and 
with a transcriptionally permissive state. 



CONCLUSION: Our Study unveils many layers 
of Xi repression and demonstrates a central 
role for RNA in the topological organization 
of mammalian chromosomes. Our study also 
supports a model in which Xist RNA simulta- 
neously acts as (i) a scaffold for the recruitment 
of repressive complexes to establish and main- 
tain the inactive state and (ii) a repulsion 
mechanism to extrude architectural factors 
such as cohesins to avoid acquisition of a 
transcription-favorable chromatin conforma- 
tion. Finally, our findings indicate that the 
stability of the Xi can be perturbed by targeted 
inhibition of multiple components of the Xist 
interactome. ■ 



The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 

fCorresponding author: lee@molbio.mgh.harvard.edu 
Cite this article as A. M. Minajigi et al.. Science 349, aab2276 
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CHROMOSOMES 

A comprehensive Xist interactome 
reveals cohesin repulsion and an 
RNA-directed dmmiosonie conformation 

Anand Minajigi/* * John E. Froberg/* Chunyao Wei/ Hongjae Sunwoo/ Barry Kesner/ 
David Colognori/ Derek Lessing/ Bernhard Payer/f Myriam Boukhali,^ 

Wilhelm Haas/ Jeannie T. Lee^t 

The inactive X chromosome (Xi) serves as a model to understand gene silencing on a 
global scale. Here, we perform “identification of direct RNA interacting proteins” (IDRiP) 
to isolate a comprehensive protein interactome for Xist, an RNA required for Xi 
silencing. We discover multiple classes of interactors— including cohesins, condensins, 
topoisomerases, RNA helicases, chromatin remodelers, and modifiers— that synergistically 
repress Xi transcription. Inhibiting two or three interactors destabilizes silencing. 
Although Xist attracts some interactors, it repels architectural factors. Xist evicts cohesins 
from the Xi and directs an Xi-specific chromosome conformation. Upon deleting Xist, 
the Xi acquires the cohesin-binding and chromosomal architecture of the active X. 

Our study unveils many layers of Xi repression and demonstrates a central role for RNA 
in the topological organization of mammalian chromosomes. 



T he mammalian X chromosome is unique in 
its ability to undergo whole-chromosome 
silencing. In the early female embryo, X- 
chromosome inactivation (XCI) enables 
mammals to achieve gene dosage equiv- 
alence between the XX female and the XY male 
(J-3). XCI depends on Xist RNA, a 17-kb long 
noncoding RNA (IncRNA) expressed only from 
the inactive X chromosome (Xi) (4) and that im- 
plements silencing by recruiting repressive com- 
plexes (5-8). Whereas XCI initiates only once during 
development, the female mammal stably main- 
tains the Xi through her lifetime. In mice, a germ- 
line deletion of Xist results in peri-implantation 
lethality due to a failure of Xi establishment (P), 
whereas a lineage-specific deletion of Xist causes 
a lethal blood cancer due to a failure of Xi mainte- 
nance {10). Thus, both the de novo establishment 
and proper maintenance of the Xi are crucial for 
viability and homeostasis. There are therefore 
two critical phases of XCI: (i) A one-time initia- 
tion phase in peri-implantation embryonic devel- 
opment that is recapitulated by differentiating 
embryonic stem (ES) cells in culture, and (ii) a 
lifelong maintenance phase that persists in all 
somatic lineages. 

Once established, the Xi is extremely stable 
and difficult to disrupt genetically and pharma- 
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cologically {11-13). In mice, X reactivation is pro- 
grammed to occur only twice: once in the blastocyst 
to erase the imprinted XCI pattern and a second 
time in the germ line before meiosis {14, 15). Al- 
though the Xi’s epigenetic stability is a homeo- 
static asset, an ability to unlock this epigenetic 
state is of great current interest. The X chromo- 
some is home to nearly 1000 genes, at least 50 of 
which have been implicated in X-linked diseases, 
such as Rett syndrome and fragile X syndrome. 
The Xi is therefore a reservoir of functional genes 
that could be tapped to replace expression of a 
disease allele on the active X (Xa). A better un- 
derstanding of Xi repression would inform both 
basic biological mechanisms and treatment of 
X-linked diseases. 

It is believed that Xist RNA silences the Xi 
through conjugate protein partners. A major gap 
in current understanding is the lack of a compre- 
hensive Xist interactome. Despite multiple at- 
tempts to define the complete interactome, only 
four directly interacting partners have been iden- 
tified over the past two decades, including PRC2, 
ATRX, YYl, and HNRPU: Polycomb repressive 
complex 2 (PRC2) is targeted by Xist RNA to the 
Xi; the ATRX RNA helicase is required for the 
specific association between Xist and PRC2 
{16, 17); YYl tethers the Xist-PRC2 complex to the 
Xi nucleation center {18); and the nuclear matrix 
factor, HNRPU/SAF-A, enables stable association 
of Xist with the chromosomal territory {19). Many 
additional interacting partners are expected, given 
the large size of Xist RNA and its numerous 
conserved modular domains. We have developed 
an RNA-based proteomic method and implement 
an unbiased screen for Xist’s comprehensive in- 
teractome. We identify a large number of high- 



confidence candidates, demonstrate that it is 
possible to destabilize Xi repression by inhibiting 
multiple interacting components, and then delve 
into a focused set of interactors with the 
cohesins. 

Results 

iDRiP identifies muitipie ciasses of 
Xist- interacting proteins 

A systematic identification of interacting factors 
has been challenging because of Xist’s large size, 
the expected complexity of the interactome, and 
the persistent problem of high background with 
existing biochemical approaches {20). A high 
background could be particularly problematic 
for chemical cross-linkers that create extensive 
covalent networks of proteins, which could in 
turn mask specific and direct interactions. We 
therefore developed iDRiP (identification of direct 
RNA interacting proteins) using the zero-length 
cross-linker, ultraviolet (UV) light, to implement 
an unbiased screen of directly interacting pro- 
teins in female mouse fibroblasts expressing phys- 
iological levels of Xist RNA (Fig. lA). We performed 
in vivo UV cross-linking, prepared nuclei, and 
solubilized chromatin by DNase I digestion. Xist- 
specific complexes were captured using nine com- 
plementary oligonucleotide probes spaced across 
the 17-kb RNA, with a 25-nucleotide probe length 
designed to maximize RNA capture while reduc- 
ing nonspecific hybridization. The complexes were 
washed under denaturing conditions to elimi- 
nate factors not covalently linked by UV to Xist 
RNA To minimize background due to DNA-bound 
proteins, a key step was inclusion of DNase I 
treatment before elution of complexes (see the 
supplementary text). We observed substantial 
enrichment of Xist RNA over highly abundant 
cytoplasmic and nuclear RNAs (U6, Jpx, and 18S 
ribosomal RNA) in eluates of female fibroblasts 
(Fig. IB). Enrichment was not observed in male 
eluates or with luciferase capture probes. Eluted 
proteins were subjected to quantitative mass spec- 
trometry (MS), with spectral counting {21) and 
multiplexed quantitative proteomics {22) yield- 
ing similar enrichment sets (table SI). 

From three independent replicates, iDRiP-MS 
revealed a large Xist protein interactome (Fig. 1C 
and table SI). Recovery of known Xist interac- 
tors, PRC2, ATRX, and HNRPU, provided a first 
validation of the iDRiP technique. Also recovered 
were macrohistone H2A (mH2A), RINGl (PRCl), 
and the condensin component, SmcHDl— proteins 
known to be enriched on the Xi {19, 23, 24) but 
not previously shown to interact directly with Xist. 
More than 80 proteins were found to be >threefold 
enriched over background; >200 proteins were 
>twofold enriched (table SI). In many cases, mul- 
tiple subunits of the epigenetic complex were 
identified, boosting our confidence in them as 
interactors. We verified select interactions by 
performing a test of reciprocity: By baiting with 
candidate proteins in an antibody capture, RIP- 
qPCR (RNA immunoprecipitation-quantitative 
polymerase chain reaction) of UV-cross-linked cells 
reciprocally identified Xist RNA in the pulldowns 
(Fig. ID). Called on the basis of high enrichment 
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values, presence of multiple subunits within a 
candidate epigenetic complex, and tests of reci- 
procity, novel high-confidence interactors fell into 
several functional categories: (i) Cohesin complex 
proteins SMCla, SMC3, RAD21, WAPL, and PDS5a/ 
b, as well as CCCTC-binding factor (CTCF) (25), 
which are collectively implicated in chromosome 
looping (26-28); (ii) histone modifiers such as 
aurora kinase B (AURKB), a serine/threonine ki- 
nase that phosphoiylates histone H3 (29); RINGl, 
the catalytic subunit of polycomb repressive 
complex 1 (PRCl) for H2A-K119 ubiquitylation 
(23); SPEN and RBM15, which associate with 
histone deacetylases; (iii) switch/sucrose non- 
fermentable (SWI/SNF) chromatin remodeling 
factors; (iv) topoisomerases TOP2a, TOP2b, and 
TOPI, which relieve torsional stress during tran- 
scription and DNA replication; (v) miscellaneous 
transcriptional regulators like MYEF2 and ELAVl; 
(vi) nucleoskeletal proteins that anchor chromo- 
somes to the nuclear envelope, SUN2, lamin-B re- 
ceptor (LBR), and LAP2; (vii) nuclear matrix proteins 
hnRPU/SAF-A, hnRPK, and MATRIN3; and (viii) the 
DNA methyltransferase DNMTl, known as a mainte- 
nance methylase for CpG dinucleotides (30). 

To study their function, we first performed 
RNA immunofluorescence in situ hybridization 
(immunoFISH) of female cells and observed sev- 
eral patterns of Xi coverage relative to the sur- 
rounding nucleoplasm (Fig. IE). Like PRC2, RINGl 
(PRCl) has been shown to be enriched on the Xi 
(23) and is therefore not pursued further. TOPI 
and TOP2a/b appeared neither enriched nor de- 
pleted on the Xi (100%, n > 50 nuclei). AURKB 
showed two patterns of localization: pericentric 
enrichment (20%, n > 50) and a more diffuse lo- 
calization pattern (80%, data not shown), consist- 
ent with its cell-cycle-dependent chromosomal 
localization (29). On the other hand, whereas 
SUN2 was depleted on the Xi (100%, n = 52), it 
often appeared as pinpoints around the Xi in 
both day 7 differentiating female ES cells (estab- 
lishment phase; 44%, n = 307) and in fibroblasts 
(maintenance phase; 38.5%, n = 52), consistent 
with SUN2’s function in tethering telomeres to 
the nuclear envelope. Finally, the cohesins and 
SWI/SNF remodelers unexpectedly showed a de- 
pletion relative to the surrounding nucleoplasm 
(100%, 72 = 50 to 100). These patterns suggest 
that the Xist interactors operate in different XCI 
pathways. 

To ask if the factors intersect the PRC2 path- 
way, we stably knocked down (KD) top candidates 
using short-hairpin RNAs (shRNAs) (table S2) and 
performed RNA immunoFISH to examine trimeth- 
yiation of histone H3-lysine 27 (H3K27me3) (Fig. 2, 
A and B). No major changes to Xist localization 
or H3K27me3 were evident in d7 ES cells (fig. SI). 
There were, however, long-term effects in fibro- 
blasts: The decrease in H3K27me3 enrichment in 
shSMARCCl and shSMARCA5 cells (Fig. 2, A and 
B) indicated that SWI/SNF interaction with Xist 
is required for proper maintenance of PRC2 func- 
tion on the Xi. Steady-state Xist levels did not 
change by more than twofold (Fig. 2C) and were 
therefore unlikely to be the cause of the polycomb 
defect. Knockdowns of other factors (cohesins. 



topoisomerases, SUN2, and AURKB) had no ob- 
vious effects on Xist localization and H3K27me3. 
Thus, whereas the SWI/SNF factors intersect the 
PRC2 pathway, other interactors do not overtly 
affect PRC2. 

Xi reactivation via targeted inhibition of 
synergistic interactors 

Given the large number of interactors, we created 
a screen to analyze effects on Xi gene expression. 
We derived clonal fibroblast lines harboring a 
transgenic green fluorescent protein (GFP) reporter 
on the Xi (fig. S2) and shRNAs against Xist in- 
teractors. Knockdown of any one interactor did 
not reactivate GFP by more than fourfold (Fig. 3A, 
shControl + none, and fig. S3A). Suspecting syn- 
ergistic repression, we targeted multiple pathways 
using a combination of drugs. To target DNMTl, 
we employed the small molecule 5'-azacytidine 
(aza) (30) at a nontoxic concentration of 0.3 pM 
[<median inhibitory concentration (IC50)], which 
minimally reactivated GFP (Fig. 3A, shControl + 
aza). To target TOP2a/b (31), we employed eto- 
poside (eto) at 0.3 pM (<IC5o), which also min- 
imally reactivated GFP (Fig. 3A, shControl + eto). 
Combining 0.3 pM aza + eto led to an 80- to 90- 
fold reactivation— a level that was almost half of 
GFP levels on the Xa (Xa-GFP) (Fig. 3A), suggest- 
ing strong synergy between DNMTl and TOP2 
inhibitors. Using aza + eto as priming agents, we 
designed triple-dmg combinations inclusive of 
shRNAs for proteins that have no specific small- 
molecule inhibitors. In various shRNA + aza + 
eto combinations, we achieved up to 230-fold 
GFP reactivation— levels that equaled or exceeded 
Xa-GFP levels (Fig. 3A). The greatest effects were 
observed for combinations using shSMARCC2 
(227x), shSMARCA4 (180x), and shRAD21 (211x). 
shTOPl and shCTCF were also effective (175x and 
154x, respectively). Combinations involving re- 
maining interactors yielded 63 to 94x reactivation. 

We then performed allele-specific RNA sequenc- 
ing (RNA-seq) to investigate native Xi genes. In 
an FI hybrid fibroblast line in which the Xi is of 
Mils musculus (mus) origin and the Xa of Mus 
casteneiis (cas) origin, >600,000 X-linked sequence 
polymorphisms enabled allele-specific calls (32). 
Two biological replicates of each of the most 
promising triple-drug treatments showed good 
correlation (figs. S4 to S6). RNA-seq analysis 
showed reactivation of 75 to 100 Xi-specific genes 
in one replicate (Fig. 3B) and up to 200 in a sec- 
ond replicate (fig. S3B), representing a large frac- 
tion of expressed X-linked genes, considering that 
only -210 X-linked genes have an fragments per 
kilobase of transcript per million mapped reads 
(FPKM) > 1.0 in this hybrid fibroblast line. Heat 
map analysis demonstrated that, for individual 
Xi genes, reactivation levels ranged from 2 to 
80x for various combinatorial treatments (Fig. 
3C). There was a net increase in expression level 
(AFPKM) from the Xi in the triple-drug-treated 
samples relative to the shControl + aza + eto, 
whereas the Xa and autosomes showed no ob- 
vious net increase, thereby suggesting preferen- 
tial effects on the Xi due to targeting synergistic 
components of the Xist interactome. Reactivation 



was not specific to any one Xi region (Fig. 3D). 
Most effective were shRAD21, shSMC3, shSMCla, 
shSMARCA4, shTOP2a, and shAURKB drug com- 
binations. Genic examination confirmed increased 
representation of mus-specific tags (red) relative 
to the shControl (Fig. 3E). Such allelic effects 
were not observed at imprinted loci and other 
autosomal genes (fig. S7), further suggesting 
Xi-specific allelic effects. The set of reactivated 
genes varied among drug treatments, although 
some genes (e.g., Rbbp7, G6pdx, and Fmrl) ap- 
peared more prone to reactivation. Thus, the Xi 
is maintained by multiple synergistic pathways, 
and Xi genes can be reactivated preferentially by 
targeting two or more synergistic Xist interactors. 

Xist interaction ieads to cohesin repuision 

To investigate the mechanism, we focused on 
one group of interactors— the cohesins— because 
they were among the highest-confidence hits 
and their knockdowns consistently destabilized 
Xi repression. To obtain Xa and Xi binding pat- 
terns, we performed allele-specific chromatin im- 
munoprecipitation sequencing (ChIP-seq) for 
two cohesin subunits, SMCla and RAD21, and 
for CTCF, which works together with cohesins 
(28, 33-36). In wild-type cells, CTCF binding was 
enriched on Xa (cas) but also showed a number 
of Xi (mus)-specific sites (Fig. 4A) (25, 36). Allelic 
ratios ranged from equal to nearly complete Xa 
or Xi skewing (Fig. 4A). For the cohesins, 1490 
SMCla and 871 RAD21 binding sites were mapped 
onto the X chromosome in total, of which allelic 
calls could be made on -50% of sites (Fig. 4, B and 
C). Although the Xa and Xi each showed signi- 
ficant cohesin binding, Xa-specific sites greatly 
outnumbered Xi-specific sites. For SMCla, 717 sites 
were called on Xa, of which 589 were Xa-specific; 
203 sites were called on Xi, of which 20 were Xi- 
specific. For RAD21, 476 sites were called on Xa, of 
which 336 were Xa-specific; 162 sites were called 
on Xi, of which 18 were Xi-specific. Biological 
replicates showed similar trends (fig. S8, A and B). 

Cohesin’s Xa preference was unexpected in 
light of Xist’s physical interaction with cohesins— 
an interaction suggesting that Xist might recruit 
cohesins to the Xi. We therefore conditionally 
ablated Xist from the Xi (Xi^®^) and repeated 
ChIP-seq analysis in the Xi^^VXa^ fibroblasts 
(37). Surprisingly, xi^®^ acquired 106 SMCla 
and 48 RAD21 sites in cis at positions that were 
previously Xa-specific (Fig. 4, C and D). Biolog- 
ical replicates trended similarly (figs. S8 and S9). 
In nearly all cases, acquired sites represented a 
restoration of Xa sites, rather than binding to 
random positions. By contrast, sites that were 
previously Xi-specific remained intact (Fig. 4, C 
and E, and fig. S8B), suggesting that they do not 
require Xist for their maintenance. The changes 
in cohesin peak densities were Xi-specific and sig- 
nificant (Fig. 4F). Cohesin restoration occurred 
throughout Xi^®^, resulting in domains of bi- 
allelic binding (Fig. 4G and figs. SIO to S12), and 
often favored regions that harbor genes that es- 
cape XCI (e.g., Bgn) (38, 39). There were also shifts 
in CTCF binding, more noticeable at a locus- 
specific level than at a chromosomal level (Fig. 4, 
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Fig. 1. iDRiP-MS reveals a large Xist interactome. (A) iDRiP schematic. 
(B) RT-qPCR demonstrated the specificity of Xist pulldown by iDRiP. Xist and 
control luciferase probes were used for pulldown from UV-cross-linked female 
and control male fibroblasts. Efficiency of Xist pulldown was calculated by 
comparing to a standard curve generated using 10-fold dilutions of input. Mean ± 
standard error (SE) of three independent experiments shown. P determined 
by Student’s t test. (C) Selected high-confidence candidates from three 



biological replicates are grouped into functional classes. Additional candi- 
dates are shown in table SI. (D) UV-RIP-qPCR validation of candidate in- 
teractors. Enrichment calculated as percentage input, as in (B). Mean ± SE of 
three independent experiments shown. P determined by Student’s t test. 
(E) RNA immunoFISH to examine localization of candidate interactors 
(green) in relation to Xist RNA (red). Immortalized MEF cells are tetraploid 
and harbor two Xi. 
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A and G), suggesting that CTCF and cohesins do 
not necessarily track together on the Xi. The ob- 
served dynamics were X chromosome-specific 
and were not observed on autosomes (fig. S13). 
To determine whether there were restoration 
hotspots, we plotted restored SMCla and RAD21 
sites (Fig. 4H, purple) on xi^®^ and observed 
clustering within gene-rich regions. We conclude 
that Xist does not recruit cohesins to the Xi- 
specific sites. Instead, Xist actively repels cohe- 
sins in cis to prevent establishment of the Xa 
pattern. 

Xist RNA directs an Xi-specific 
chromosome conformation 

Cohesins and CTCF have been shown to facili- 
tate formation of large chromosomal domains 
called TADs (topologically associated domains) 
(27, 28, 34, 35, 40-42). The function of TADs is 
currently not understood, because TADs are largely 



invariant across development. However, X-linked 
domains are exceptions to this rule and are 
therefore compelling models to study function of 
topological structures (43-46). By carrying out 
allele-specific Hi-C, we asked whether cohesin 
restoration altered the chromosomal architec- 
ture of Xi^®^. First, we observed that, in wild- 
type cells, our TADs called on autosomal contact 
maps at 40-kb resolution resembled published 
composite (nonallelic) maps (27) (Fig. 5A, bottom). 
Our X chromosome contact maps were also con- 
sistent, with TADs being less distinct due to a 
summation of Xa and Xi reads in the composite 
profiles (Fig. 5A, top). Using the 44% of reads 
with allelic information, our allelic analysis 
yielded high-quality contact maps at 100-kb 
resolution by combining replicates (Fig. 5B and 
fig. S14A) or at 200-kb resolution with a single 
replicate. In wild-type cells, we deduced 112 TADs 
at 40-kb resolution on the X chromosome using the 



method of Dixon et at. (27). We attempted TAD 
calling for the Xi on the 100-kb contact map but 
were unable to obtain obvious TADs, suggesting 
that the 112 TADs are present only on the Xa. The 
Xi instead appeared to be partitioned into two 
megadomains at the DXZ4 region (fig. S14A) (46). 
Thus, although the Xa is topologically organized 
into structured domains, the Xi is devoid of such 
megabase-scale structures across its full length. 

When Xist was ablated, however, TADs were 
restored in cis, and the Xi reverted to an Xa-like 
conformation (Fig. 5B and fig. SMB). In mutant 
cells, ~30 TADs were gained on xi^®^ in each 
biological replicate. Where TADs were restored, 
^Axist pattej-ns (red) became nearly identical to 
those of the Xa (blue), with similar interaction 
frequencies. These regions now bore little 
resemblance to the Xi of wild-type cells (Xi^, 
orange). Overall, the difference in the average in- 
teraction scores between Xi^ and xi^^®^ was 
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Fig. 2. Effect of depleting Xist interactors on H3K27me3. (A) RNA 

immunoFISH of Xist (red) and H3K27me3 (green) after shRNA KD of inter- 
actors in fibroblasts (tetraploid; two Xist clouds). KD efficiencies (fraction 
remaining): SMCla-0.48, SMC3-0.39, RAD21-0.15, AURKB-0.27, TOP2b- 
0.20,TOP2a-0.42,TOPl-0.34, CTCF-0.62, SMARCA4-0.52, SMARCA5-0.18, 
SMARCCl-0.25, SMARCC2-0.32, SMARCBl-0.52, and SUN2-0.72. Some fac- 




tors are essential: therefore, high-percentage KD may be inviable. All images 
are presented at the same photographic exposure and contrast. (B) Quan- 
titation of RNA immunoFISH results, n, sample size. Percentages of aberrant 
Xist/H3K27me3 associations are shown. (C) RT-qPCR of Xist levels in KD fi- 
broblasts, normalized to shControls. Means ± SD of two independent exper- 
iments are shown. 



aab2276-4 17 July 2015 • vol 349 issue 6245 



sciencemag.org SCIENCE 





















RESEARCH | RESEARCH ARTICLE 




0 Allele-specific RNA-seq 

Xi-specific upregulation 
“ Upregulated on Xi and Xa 




DQi-C'jC'jnsconSLLTi- 

QCO 030 ^^1-0 
31- QCCOS coOOQC 
<^C0 



CO 



■ shRNA + aza + eto ■■ 



CO 



shRNA + aza + eto ■ 



log2 FC 



C\J 

O < 

m o ^ < o 

(Mii;-,-DCc\jCOT-c\,LLQC 

zo:d_<qooq_o< 
33 o s S SOI-S 

C0< I- CO QC CO COI-OCO 
wwwwwcowwww 






8.00 

5.33 

2.67 

0.00 

■ 2.67 

- 5.33 

- 8.00 



Refseq genes 
shAURKB 
shSUN2 
shSMARCA4 
shSMARRC2 
shTOPI 
shTOP2a 
shCTCF 
shRAD21 
shSMCIa 
shSMC3 



Bgn Tmem47 Pet2 Lasll 




ShControl . , 










k 




i. 


1 


[0-6] 


shRAD21 












k 


k 


1 




shSMCIa ..... . 










1 


H 


L 


i 




shTOP2a ..... . 














i 


k 




shControl 


















““ [cF2] 


shRAD21 




















shSMCIa 










J 










shTOP2a 












- 









Dkc1 



Q. ; 

E : 
o ; 



O i 



shControl k. .a . . 

shRAD21 . a A i 

ShCTCF -miki—.*. .i ... .A . 
shControl 

shRAD21 -.1 

ShCTCF . M ■ 



G6pdx 



Q. 

E 



shControl 

shRAD21 

shSMC3 

shSMARCA4 



i 

i 

M 

i 



shControl - 
shRAD21 
shSMC3 
shSMARCA4 . 



1 i i L .. r 

I L I i . ii. 

i I i L . -A Ml- 

L 1 I k .i HL. 



Eiflax 



shControl 


i 


i 


A 


1 




i 


li 1 


J 


... 


shTOP2a 


I 


1 


_ _ i 


1 


1 


1 


li 1 


J. .. 


.. 1- l«L-. 


ShAURKB 


1. 


_l 


_ _ _ 1 


1 


i 


1 


li i 


.. J_ . 


.. -1. 1.. 


ShCTCF 


1 


i 


. _ _ 1 


1 


i 


i 


li i 


L_ 




shControl 
















J 


[0-2] 


shTOP2a 














i 


1 


L .. 


ShAURKB 
















J 


L _ 


ShCTCF 


I” 




1 


1 








I 





Rbbp7 



Fig. 3. Derepression of Xi genes by targeting Xist interactors. (A) Relative 
GFP levels by RT-qPCR analysis in female fibroblasts stably knocked down for 
indicated interactors ± 0.3 |iM 5'-azacytidine (aza) ± 0.3 |iM etoposide (eto). Xa- 
GFR control male fibroblasts with X-linked GFP. Means ± SE of two independent 
experiments are shown. R determined by Student’s t test. (B) Allele-specific RNA- 
seq analysis: Number of up-regulated Xi genes for each indicated triple-drug 
treatment (aza + eto -f shRNA). Blue, genes specifically reactivated on Xi [fold 
change (FC) > 2]; red, genes also up-regulated on Xa (FC > 1.3). (C) RNA-seq heat 



map indicating that a large number of genes on the Xi were reactivated. X-linked 
genes reactivated in at least one of the triple-drug treatment (aza + eto -f shRNA) 
were shown in the heat map. Color key, Log2 FC. Cluster analysis performed based 
on similarity of KD profiles (across) and on the sensitivity and selectivity of various 
genes to reactivation (down). (D) Chromosomal locations of Xi reactivated genes 
(colored ticks) for various aza + eto -f shRNA combinations. (E) Read coverage of 
four reactivated Xi genes after triple-drug treatment. Xi, mus reads (scale, 0 to 2). 
Comp, total reads (scale, 0 to 6). Red tags appear only in exons with SNPs. 
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highly significant (Fig. 5C and fig. S15A). Inter- 
secting TADs with SMCla sites on xi^^ revealed 
that 61 restored cohesin sites overlapped restored 
TADs (61 did not overlap). In general, restored 
cohesin sites occurred both within TADs and at 



TAD borders. TADs overlapping restored peaks 
had larger increases in interaction scores relative 
to all other TADs (Fig. 5D and fig. S15B), and we 
observed an excellent correlation between the 
restored cohesin sites and the restored TADs 



(Fig. 5E and fig. S15C), consistent with a role of 
cohesins in reestablishing TADs after Xist deletion. 
Taken together, these data uncover a role for RNA 
in establishing topological domains of mammalian 
chromosomes and demonstrate that Xist must 
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and (AXist) fibre- 
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(KS) test. (B) Differences 
between SMCla or RAD21 
peaks on the Xi^^ versus 
Xa^^. Black diagonal, 1:1 
ratio. Plotted are read 
counts for all normalized 
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Allele-specific skewing is defined as >threefold skew toward either Xa (cas, blue 
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read counts: black diagonal, 1:1 Xi^^'^VXa^^ ratio: red diagonal, 1:2 ratio. (E) Xi- 
specific SMCla or RAD21 peaks remained on Xi^^'^^ Plotted are read counts 
for SMCla or RAD21 peaks with >threefold skew toward Xi'^^ (“Xi-specific”). 
(F) Comparison of fold changes for CTCF, RAD21, and SMCl binding in 
cells relative to WT cells. Shown are fold changes for Xi versus Xa.The Xi showed 
significant gains in RAD21 and SMCla binding, but not in CTCF binding. Method: 
X^^and ChIP samples were normalized by scaling to equal read counts. 
Fold changes for Xi were computed by dividing the normalized mus read count in 
XAXist count X^^: fold changes for Xa were computed by dividing 

the normalized cas read count in by the cas read count X^^. To eliminate 
noise, peaks with <10 allelic reads were eliminated from analysis. P values were 
determined by a paired Wilcoxon signed rank test. (G) The representative 
examples of cohesin restoration on Xi'^^'^^ Arrowheads, restored peaks. 
(H) Allelic-specific cohesin binding profiles of Xa, Xi^^, and Xi^^'^^ Shown 
below restored sites are regions of Xi reactivation after shSMCla and shRAD21 
triple-drug treatments, as defined in Fig. 3. 
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Fig. 5. Ablating Xist A 
results in XI rever- 
sion to an Xa-like 
chromosome confor- 
mation. (A) Chris and 
X chromosome contact 
maps showing triangu- 
lar domains representa- 
tive of TADs. Purple 
shades correspond to 
varying interaction fre- 
quencies (dark, greater 
interactions). TADs 
called from our 
composite (nonallelic) 

HiC data at 40-kb reso- 
lution (blue bars) are 
highly similar to those 
(gray bars) called previ- 
ously (27). (B) Allele- 
specific HiC-seq 
analysis: Contact maps 
for three different 
X chromosome regions 
at 100-kb resolution 
comparing Xi^^'^* (red) 
to Xi^ (orange), and 

XiAXist Xa 

(blue) of the mutant cell 
line. Our TAD calls are 
shown with reference 
sequence (RefSeq) 
genes. (C) Fraction of 
interaction frequency 
per TAD on the Xi (mus) 
chromosome. The 
positions of our TAD 
borders were rounded 
to the nearest 100 kb, 
and submatrices were 
generated from all pixels 
between the two end 
points of the TAD border 
for each TAD. We 
calculated the average 
interaction score for 
each TAD by summing 
the interaction scores 
for all pixels in the sub- 
matrix defined by a TAD 
and dividing by the total 
number of pixels in the 
TAD. We then averaged 
the normalized interac- 
tion scores across all 
bins in a TAD in the Xi (mus) and Xa (cas) contact maps and computed the 
fraction of averaged interaction scores from mus chromosomes. The X chro- 
mosome and a representative autosome, Chr5, are shown for the WT cell line and 
the Xist^^'^V-i- cell line. Rvalues were determined by paired Wilcoxon signed rank 
test. (D) Violin plots showing that TADs overlapping restored peaks have larger 
increases in interaction scores relative to all other TADs. We calculated the fold 
change in average interaction scores on the Xi for all X-linked TADs and 
intersected the TADs with SMCla sites (Xi^^'^VXi^). Thirty-two TADs occurred 
at restored cohesin sites: 80 TADs did not overlap restored cohesin sites. Violin plot 
shows distributions of fold change average interaction scores between Xi'^and 
Xi^xist p^glyg 3 determined by Wilcoxon ranked sum test. (E) Restored TADs 



overlap regions with restored cohesins on across Xi^^'^^ Several data sets were 
used to call restored TADs, each producing similar results. Restored TADs were 
called in two separate replicates (Repl and Rep2) where the average interaction 
score was significantly higher on Xi^^'^* than on Xi^. We also called restored TADs 
based on merged Repl -r Rep2 data sets. Finally, a consensus between Repl and 
Rep2 was derived. Method: We calculated the fold change in mus or cas for all 
TADs on the X chromosome and on a control, Chr5, then defined a threshold for 
significant changes based on either the autosomes or the Xa. We treated Chr5 as 
a null distribution (few changes expected on autosomes) and found the fraction of 
TADs that crossed the threshold for several thresholds. These fractions corre- 
sponded to a FDR for each given threshold. An FDR of 0.05 was used. 
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actively and continually repulse cohesins from the 
Xi, even during the maintenance phase, to prevent 
formation of an Xa chromosomal architecture. 

Discussion 

Using iDRiP, we have identified a comprehen- 
sive Xist interactome and revealed multiple syn- 
ergistic pathways to Xi repression (Fig. 6). With 
Xist physically contacting 80 to 250 proteins at 
any given time, the Xist ribonucleoprotein parti- 
cle may be as large as the ribosome. Our study 
supports a model in which Xist RNA simulta- 
neously acts as (i) scaffold for the recruitment of 
repressive complexes (such as PRCl, PRC2, ATRX, 
mH2A, and SmcHDl) to establish and maintain 
the inactive state; and as (ii) a repulsion mecha- 
nism to extrude architectural factors such as 
cohesins to avoid acquisition of a transcription- 
favorable chromatin conformation. Without Xist, 
cohesins return to their default Xa binding state. 
Repulsion could be based on eviction, with Xist 
releasing cohesins as it extrudes them, or on se- 
questration, with Xist sheltering cohesins to pre- 
vent Xi binding. Our study shows that the Xi 
harbors three types of cohesin sites: (i) Xi-specific 
sites that do not depend on Xist; (ii) biallelic 
sites that are also Xist-independent; and (hi) Xa- 
specific sites, many of which cannot be estab- 
lished on the Xi because of active repulsion by 
Xist. The type (i) and type (iii) sites likely explain 
the paradoxical observations that, on the one hand, 
depleting cohesins leads to Xi reactivation but, 
on the other, loss of Xist-mediated cohesin recruit- 
ment leads to an Xa-like chromosome conforma- 
tion that is permissive for transcription. In essence, 
modulating the type (i) and type (iii) sites both 
have the effect of destabilizing the Xi, rendering 
the Xi more accessible to transcription. Disrupt- 
ing type (i) sites by cohesin knockdown would 
change the repressive Xi structure, while ablating 
Xist would restore the type (iii) sites that promote 
an Xa-like conformation. Our study has focused 
on cohesins, but RNA-mediated repulsion maybe 
an outcome for other Xist interactors and maybe 
as prevalent an epigenetic mechanism as RNA- 
mediated recruitment (47). 

The robustness of Xi silencing is demonstrated 
by the observation that we destabilized the Xi 
only after pharmacologically targeting two or 
three distinct pathways. The fact that the triple- 
drug treatments varied with respect to reacti- 
vated loci and depth of derepression creates the 
possibility of treating X-linked disease in a locus- 
specific manner by administering unique drug 
combinations. Given the existence of many other 
disease-associated IncRNAs, the iDRiP technique 
could be applied systematically toward identify- 
ing new drug targets for other diseases and gen- 
erally for elucidating mechanisms of epigenetic 
regulation by IncRNA. 

Materials and methods 
Identification of Direct RNA interacting 
Proteins (iDRiP) 

Mouse embryonic fibroblasts (MEFs) were irra- 
diated with UV light at 200 mJ energy (Stratagene 
2400) after rinsing with phosphate-buffered 
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saline (PBS). The pellets were resuspended in 
cytoskeleton buffer with 0.5% Triton X-100 
(CSKT)-0.5% [10 mM piperazinediethanesulfonic 
acid, pH 6.8, 100 mM NaCl, 3 mM MgCfy 0.3 M 
sucrose, 0.5% Triton X-100, 1 mM phenylmethyl- 
sulfonyl fluoride (PMSF)] for 10 min at 4°C fol- 
lowed by a spin. The pellets were again resuspended 
in nuclear isolation buffer (10 mM Tris pH 7.5, 
10 mM KCl, 0.5% Nonidet-P 40, lx protease in- 
hibitors, 1 mM PMSF), and rotated at 4°C for 10 min 
(optional step). The pellets were collected after 
a spin, weighed, flash frozen in liquid nitrogen, 
and stored at -80°C until use. 

Approximately equal amounts of female and 
male UV cross-linked pellets were thawed and 
resuspended for treatment with Turbo DNase I 
in the DNase I digestion buffer (50 mM Tris pH 
7.5, 0.5% Nonidet-P 40, 0.1% sodium lauroyl sar- 
cosine, lx protease inhibitors, Superasein). The 
tubes were rotated at 37°C for 45 min with inter- 
mittent mixing or vortexing. The nuclear lysates 
were further solubilized by adding 1% sodium 
lauroyl sarcosine, 0.3 M lithium chloride, 25 mM 
EDTA, and 25 mM EGTA to final concentrations. 
After brief vortexing, continue incubation at 37°C 
for 15 min. The lysates were mixed with biotiny- 
lated DNA probes (table S3) prebound to the 
streptavidin magnetic beads (MyOne streptavidin 
Cl Dyna beads, Invitrogen) and incubated at 
55°C for 1 hour before overnight incubation at 
37°C in the hybridization chamber. The beads 
were washed three times in wash buffer (10 mM 
Tris, pH 7.5, 0.3 M LiCl, 1% LDS, 0.5% Nonidet-P 
40, lx protease inhibitor) at room temperature 
followed by treatment with Turbo DNase I in 
DNase I digestion buffer with the addition of 
0.3 M LiCl, protease inhibitors, and superasein at 
37°C for 20 min. Then, beads were resuspended 
and washed two more times in the wash buffer. 



For MS analysis, elution was done in elution 
buffer (10 mM Tris, pH 7.5, 1 mM EDTA) at 70°C 
for 4 min followed by brief sonication in Covaris. 
For the quantification of pulldown efficiency, 
MEFs, without cross-linking, were used and elu- 
tion was done at 95°C. The elute was used for 
RNA isolation and reverse transcription qPCR 
(RT-qPCR). When cross-linked MEFs were used, 
elute was subjected for proteinase-K treatment 
(50 mM Tris pH 7.5, 100 mM NaCl, 0.5% SDS, 10 
pg proteinase K) for 1 hour at 55°C. RNA was 
isolated by Trizol and quantified with SYBR 
green qPCR. Input samples were used to make 
standard curve by 10-fold dilutions, to which the 
RNA pulldown efficiencies were compared and 
calculated. The efficiency of Xist pulldown was 
relatively lower after UV cross-linking, similar to 
(48, 49). 

Quantitative proteomics 

Proteins co-enriched with Xist from female or 
male cells were quantitatively analyzed either 
using a label-free approach based on spectral 
counting (21) or by multiplexed quantitative pro- 
teomics using tandem-mass tag (TMT) reagents 
(50, 51) on an Orbitrap Fusion mass spectrometer 
(Thermo Scientific). Disulfide bonds were reduced 
with ditheiothreitol (DTT) and free thiols alkylated 
with iodoacetamide as described previously (22). 
Proteins were then precipitated with tricholoracetic 
acid, resuspended in 50 mM HEPES (pH 8.5) and 
1 M urea, and digested first with endoproteinase 
Lys-C (Wako) for 17 hours at room temperature 
and then with sequencing-grade tiypsin (Promega) 
for 6 hours at 37°C. Peptides were desalted over 
Sep-Pak C 18 solid-phase extraction (SPE) cartridges 
(Waters), and the peptide concentration was de- 
termined using a bicinchoninic acid (BCA) assay 
(Thermo Scientific). For the label-free analysis. 




Fig. 6. The Xi is suppressed by multiple synergistic mechanisms. Xist RNA (red) suppresses the 
Xi by either recruiting repressive factors (e.g., PRCl and PRC2) or expelling architectural factors (e.g., 
cohesins). 
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peptides were then dried and resuspended in 5% 
formic acid (FA) and 5% acetonitrile (ACN), and 
5 pg of peptides were analyzed by mass spec- 
trometry as described below. For the multiplexed 
quantitative analysis, a maximum of 50 pg of pep- 
tides were labeled with one out of the available 
TMT-lOplex reagents (Thermo Scientific) (50). 
To achieve this, peptides were dried and resus- 
pended in 50 pi of 200 mM HEPES (pH 8.5) and 
30% (ACN), and 10 pg of the TMT in reagent in 
5 pi of anhydrous ACN was added to the solution, 
which was incubated at room temperature (RT) 
for 1 hour. The reaction was then quenched by 
adding 6 pi of 5% (w/v) hydroxylamine in 200 mM 
HEPES (pH 8.5) and incubated for 15 min at RT. 
The labeled peptide mixture was then subjected 
to a fractionation using basic pH reversed-phase 
liquid chromatography (bRPLC) on an Agilent 
1260 Infinity high-performance liquid chroma- 
tography (HPLC) system equipped with an Agilent 
Extend-C18 column (4.6 x 250 mm; particle size, 
5 pm), basically as described previously {52). Pep- 
tides were fractionated using a gradient from 
22 to 35% ACN in 10 mM ammonium bicarbo- 
nate over 58 min at a flow rate of 0.5 ml/min. 
Fractions of 0.3 ml were collected into a 96-well 
plate to then be pooled into a total 12 fractions 
(A1 to A12, B1 to B12, etc.) that were dried and 
resuspended in 8 pi of 5% FA and 5% ACN, 3 of 
which were analyzed by microcapillaiy liquid 
chromatography tandem mass spectrometry on an 
Orbitrap Fusion mass spectrometer, and a recently 
introduced multistage (MS3) method was used to 
provide highly accurate quantification {53). 

The mass spectrometer was equipped with 
an EASY-nLC 1000 integrated autosampler and 
HPLC pump system. Peptides were separated 
over a 100-pm inner diameter microcapillary col- 
umn in-house packed with first 0.5 cm of Magic 
C4 resin (5 pm, 100 A, Michrom Bioresources), 
then with 0.5 cm of Maccel Cig resin (3 pm, 200 A, 
Nest Group) and 29 cm of GP-C18 resin (1.8 pm, 
120 A, Sepax Technologies). Peptides were eluted 
applying a gradient of 8 to 27% ACN in 0.125% 
formic acid over 60 min Qabel-free) and 165 min 
(TMT) at a flow rate of 300 nl/min. For label-free 
analyses, we applied a tandem-MS method where 
a full-MS spectrum [MSI; mass-to-charge ratio 
{m/z) 375 to 1500; resolution 6 x 10^; automated 
gain control (AGC) target, 5 x 10^; maximum 
injection time, 100 ms] was acquired using the 
Orbitrap, after which the most abundant peptide 
ions where selected for linear ion trap CID-MS2 
in an automated fashion. MS2 scans were done 
in the linear ion trap using the following set- 
tings: quadrupole isolation at an isolation width 
of 0.5 Th; fragmentation method, CID; AGC tar- 
get, 1 X 10^; maximum injection time, 35 ms; nor- 
malized collision energy, 30%). The number of 
acquired MS2 spectra was defined by setting the 
maximum time of one experimental cycle of MSI 
and MS2 spectra to 3 s (top speed). To identify 
and quantify the TMT-labeled peptides, we ap- 
plied a synchronous precursor selection MS3 meth- 
od {22, 53, 54) in a data-dependent mode. The 
scan sequence was started with the acquisition of 
a full MS or MSI spectrum acquired in the Orbi- 



trap {m/z range, 500 to 1200; other parameters 
were set as described above), and the most in- 
tense peptide ions detected in the full MS spec- 
trum were then subjected to MS2 and MS3 analysis, 
while the acquisition time was optimized in an 
automated fashion (top speed, 5 s). MS2 scans 
were performed as described above. Using syn- 
chronous precursor selection, the 10 most abun- 
dant fragment ions were selected for the MS3 
experiment after each MS2 scan. The fragment 
ions were further fragmented using the higher- 
energy collisional dissociation (HCD) fragmentat- 
ion (normalized collision energy, 50%), and the 
MS3 spectrum was acquired in the Orbitrap (re- 
solution, 60,000; AGC taiget, 5 x lO'^; maximum 
injection time, 250 ms). 

Data analysis was performed on an in-house 
generated SEQUEST-based (55) software platform. 
RAW files were converted into the mzXML format 
using a modified version of ReAdW.exe. MS2 
spectra were searched against a protein sequence 
database containing all protein sequences in the 
mouse UniProt database (downloaded 4 February 
2014), as well as that of known contaminants 
such as porcine trypsin. This target component 
of the database was followed by a decoy com- 
ponent containing the same protein sequences 
but in flipped (or reversed) order (55). MS2 spec- 
tra were matched against peptide sequences, with 
both termini consistent with trypsin specificity 
and allowing two missed trypsin cleavages. The 
precursor ion miz tolerance was set to 50 parts 
per million; TMT tags on the N terminus and on 
lysine residues (229.162932 Da, only for TMT analy- 
ses), as well as carbamidomethylation (57.021464 
Da) on cysteine residues were set as static mod- 
ification; and oxidation (15.994915 Da) of methio- 
nines was set as variable modification. Using the 
target-decoy database search strategy (55), a spec- 
tra assignment false discovery rate (FDR) of less 
than 1% was achieved through using linear dis- 
criminant analysis, with a single discriminant score 
calculated from the following SEQUEST search 
score and peptide sequence properties: mass de- 
viation, XCorr, dCn, number of missed trypsin 
cleavages, and peptide length (57). The probability 
of a peptide assignment to be correct was cal- 
culated using a posterior error histogram, and 
the probabilities for all peptides assigned to a 
protein were combined to filter the data set for 
a protein FDR of less than 1%. Peptides with 
sequences that were contained in more than one 
protein sequence from the UniProt database were 
assigned to the protein with the most matching 
peptides (57). 

For a quantitative estimation of protein con- 
centration using spectral counts, we counted the 
number of MS2 spectra assigned to a given pro- 
tein (table SI). TMT reporter ion intensities were 
extracted as that of the most intense ion within a 
0.03 Th window around the predicted reporter 
ion intensities in the collected MS3 spectra. Only 
MS3 with an average signal-to-noise value of larger 
than 28 per reporter ion, as well as with an iso- 
lation specificity {22) of larger than 0.75, were con- 
sidered for quantification. Reporter ions from all 
peptides assigned to a protein were summed to 



define the protein intensity. A two-step normal- 
ization of the protein TMT intensities was per- 
formed by first normalizing the protein intensities 
over all acquired TMT channels for each protein 
based to the median average protein intensity 
calculated for all proteins. To correct for slight 
mixing errors of the peptide mixture from each 
sample, a median of the normalized intensities 
was calculated from all protein intensities in each 
TMT channel, and the protein intensities were 
normalized to the median value of these median 
intensities. 

UV RIP 

The protocol followed is similar to the one de- 
scribed in {18). Briefly, MEFs were cross-linked 
with UV light at 200 mJ and collected by scrap- 
ing in PBS. Cell pellets were resuspended in 
CSKT-0.5% for 10 min at 4°C followed by a spin. 
The nuclei were resuspended in the UV RIP buf- 
fer [PBS buffer containing 300 mM NaCl (total), 
0.5% Nonidet-P 40, 0.5% sodium deoxycholate, 
200 U Protector RNase Inhibitor and lx protease 
inhibitors] with Turbo DNase I 30 U/IP for 
30 min at 37°C. Supernatants were collected af- 
ter a spin and incubated with 5 pg specific anti- 
bodies prebound to 40 pi protein-G magnetic 
beads (Invitrogen) at 4°C overnight. Beads were 
washed three times with cold UV RIP buffer. The 
beads were resuspended in 200 pi Turbo DNase I 
buffer with 20 U Turbo DNase, SuperaseIN, lx 
protease inhibitors for 30 min at 37°C. The beads 
were resuspended and washed three more times 
in the UV RIP washing buffer containing 10 mM 
EDTA The final three washes were given after 
threefold dilution of UV RIP washing buffer. The 
beads were resuspended in 200 pi proteinase-K 
buffer with 10 pg proteinase-K and incubated at 
55°C for 1 hour. RNA was isolated by Trizol, and 
pulldown efficiencies were calculated by SYBR 
qPCR using input for the standard curve. 

Generation of Xi-TgGFP clonal fibroblasts 

Xi-TgGFP (68-5-11) tail-tip fibroblasts (ITF) were 
initially derived from a single female pup, a 
daughter of a cross between slM. castaneus male 
and a M. musculus female, homozygous for an 
X-linked GFP transgene driven by a strong, ubi- 
quitous promoter {58). The fibroblasts were im- 
mortalized by SV40 transformation, and clonal 
lines were derived from individual GFP-negative 
cells selected by fluorescence-activated cell sort- 
ing. In our experience, occasional clones with 
undetectable GFP expression nevertheless have 
the transgene located on the active X chromosome. 
Thus, we confirmed the GFP transgene location 
on the inactive X for the particular clone used 
here, 68-5-11 (see fig. S2). 

Generation of stable KD of Xi-TgGFP TTF 
and 16.7 ES cells 

The protocol is as described in http://www. 
broadinstitute.oig/mai/public/resources/protocols 
A cocktail of three shRNA viruses was used for 
infections (table S2) followed with puromycin 
selection. In all the experiments, nonclonal knock- 
down cells were used. 
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Assay for the reactivation of Xi-TgGFP 

About 125,000 to 150,000 Xi-TgGFP (68-5-11) cells 
were plated along with control (shControl) cells 
treated with dimethyl sulfoxide or stable KD 
cells treated with 0.3 pM azacytidine and 0.3 pM 
Etoposide for 3 days in six well plates. RNA was 
isolated by Trizol twice, with an intermittent 
TurboDNase treatment after the first isolation 
for 30 min at 37°C. One pg RNA was used for 
each of the RT+ and RT- reactions (Superscript 
III, Invitrogen) followed by the SYBR green 
qPCR using the primers listed in table S3, with 
annealing temperature of 60°C for 45 cycles. The 
relative efficiency of Xi-TgGFP reactivations was 
calculated by comparing to U1 snRNA as the 
internal control. 

immunoFiSH 

Cells were grown on coverslips, rinsed in PBS, 
pre-extracted in 0.5% CSKT on ice, and washed 
once in CSK, followed by fixation with 4% para- 
formaldehyde in PBS at room temperature. After 
blocking in 1% bovine serum albumin in PBS for 
20 min supplemented with 10 mM ribonucleoside- 
vanadyl complex (VRC) (New England Biolabs) 
and RNase inhibitor (Roche), incubation was 
carried out with primary antibodies (table S3) 
at room temperature for 1 hour. Cells were 
washed three times in PBST-0.02% Tween-20. 
After incubating with secondary antibody at 
room temperature for 30 min, cells were washed 
three times by PBS/0.02% Tween-20. Cells were 
fixed again in 4% paraformaldehyde and dehy- 
drated in ethanol series. RNA FISH was per- 
formed using a pool of Cy3B- or Alexa 568-labeled 
Xist oligonucleotides for 4 to 6 hours at 42°C in a 
humid chamber. Cells were washed three times 
in 2X SSC, and nuclei were counter-stained by 
Hoechst 33342. Cells were observed under Nikon 
90i microscope equipped with 60X/1.4 N.A. ob- 
jective lens, Orca ER charge-coupled device camera 
(Hamamatsu), and Volocity software (Perkin Elmer). 
Xist RNA FISH probes, a set of total 37 oligo- 
nucleotides with 5' amine modification (IDT), were 
labeled with NHS-Cy3B (GE Healthcare) overnight 
at room temperature followed by ethanol precip- 
itation. In the case of confirmation of Xi-TgGFP 
cells, probes were made by nick translation of a 
GFP PCR product with Cy3-dUTP and of a plas- 
mid containing the first exon of the mouse Xist 
gene, with FITC-dUTP. 

Aiieiic ChiP-seq 

Allele-specific ChIP-seq was performed accord- 
ing to the method of Kung et al (25), in two 
biological replicates. To increase available read 
depth, we pooled together two technical repli- 
cates for Xi^^VXa^ Rad21 replicate 1 sequenced 
on a 2 X 50 bp HiSeq. 2500 rapid run, and we 
also pooled two technical replicates of wild-type 
Rad21 replicate 1, one sequenced on a HiSeq. 2 x 
50 bp run and one on a MiSeq. 2 x 50 bp run. All 
other libraries were sequenced on using 2 x 50 bp 
HiSeq. 2500 rapid runs. To visualize ChIP bind- 
ing signal, we generated fragments per million 
(fpm)-normalized bigWig files from the raw ChIP 
read counts for all reads (comp), mus-specific 



(mus), and cas-specific reads separately. For Smcla, 
CTCF, and Rad21, peaks were called using macs2 
with default settings. To generate consensus peak 
sets for all three epitopes, peaks for the two wild- 
type and Xi^^/Xa^ replicates were pooled, and 
peaks present in at least two experiments were 
used as the common peak set. To make compar- 
isons between allelic read counts between differ- 
ent experiments, we defined a scaling factor as 
the ratio of the total read numbers for the two 
experiments and multiplied the allelic reads for 
each peak in the larger sample by the scaling fac- 
tor. We plotted the number of reads on Xi vs Xa in 
wild-type for all peaks on the X-chromosome to 
determine whether there is a general bias toward 
binding to the Xa or the Xi. To evaluate allelic 
skew on an autosome, we generated plots of mus 
read counts versus cas read counts for all peaks on 
chromosome 5 from 1 to 140,000,000. We used 
this particular region of chromosome 5 because 
XiAxi®t/xa^ is not fully hybrid, and this is a large 
region of an autosome that is fully hybrid based 
on even numbers of read counts from input and 
from our Hi-Cs over this region in Xi^^VXa^ 
(data not shown). To identify peaks that are 
highly Xa-skewed in wild-type but bind substan- 
tially to the Xi in Xi^fyXa^ (restored peaks), for 
Xa-skewed peaks in wild-type, we plotted nor- 
malized read counts on Xi in Xi^®VXa^ versus 
read counts on Xa in wild-type. We defined 
restored peaks as peaks that are (i) more than 3X 
Xa-skewed in wild-type, (ii) have at least five 
allelic reads in wild-type, and (iii) exhibit nor- 
malized read counts on Xi in Xi^^VXa^ that 
are at least half the level of Xa in wild-type. This 
threshold ensures that all restored peaks have at 
least a 2X increase in binding to the Xi in Xi^®V 
Xa^ relative to wild-type. We identified restored 
peaks using these criteria in both replicates of 
Smcla and Rad21 ChIP separately, and to merge 
these calls into a consensus set for each epitope, 
we took all peaks that met criteria for restoration 
in at least one replicate and had at least 50% 
wild-type Xa read counts on Xi in Xi^^^VXa^ 
in both replicates. 

Aiieie-specific RNA-seq 

Xi-TgGFP TTFs (68-5-11) with the stable knock- 
down of candidates were treated with 5 -azacytidine 
and etoposide at 0.3 pM each for 3 days. Strand- 
specific RNA-seq, the library preparation, deep 
sequencing, and data analysis was followed as 
described in (59). Two biological replicates of 
each drug treatment were produced. All libraries 
were sequenced with Illumina HisEq. 2000 or 
2500 using 50 cycles to obtain paired-end reads. 
To determine the allelic origin of each sequenc- 
ing read from the hybrid cells, reads were first 
depleted of adaptors dimers and PCR duplicates, 
followed by the alignment to custom mus/129 and 
cas genomes to separate mus and cas reads. After 
removal of PCR duplicates, -90% of reads were 
mappable. Discordant pairs and multimapped 
reads were discarded. Reads were then mapped 
back to reference mm9 genome using Tophat 
V2.0.10 (-g 1-no-coverage-search-read-edit-dist 
3-read-mismatches 3-read-gap-length 3-b2-very- 



sensitive-mate-inner-dist 50-mate-std-dev 50- 
libraiy-type fr-firststrand), as previously described 
(25, 32, 59). After alignment, gene expression 
levels within each library were quantified using 
Homer v4.7 (rna mm9 -count genes -strand + 
-noadj -condenseGenes) (59), and the normalized 
differential expression analyses across samples 
were performed by using EdgeR (60). 

Hi-C iibrary preparation and anaiysis 

Hi-C libraries were generated according to the 
protocol in Lieberman-Aiden et al. (61). Two bio- 
logical replicate libraries were prepared for 
wild-type and Xi^'^VXa^ fibroblasts each. We 
obtained 150 to 220 million 2 x 50 bp paired-end 
reads per library. The individual ends of the 
read-pairs were aligned to the mus and cas refe- 
rence genomes separately using novoalign with 
default parameters for single-end alignments, and 
the quality score of the alignment was used to 
determine whether each end could be assigned 
to either the mus or the cas haplotype (62). The 
single-end alignments were merged into a Hi-C 
summary file using custom scripts. Reads were 
filtered for self-ligation events and short fragments 
Gess than 1.5X the estimated insert length) likely 
to be random shears using Homer (59, 63). Hi-C 
contact maps were generated using Homer. “Comp” 
maps were made from all reads. ‘XI” and ‘Xa” 
reads were from reads where at least one read- 
end could be assigned to either the mus or cas 
haplotype, respectively. A small fraction of reads 
(-5% of all allelic reads) aligned such that one end 
aligned to mus, the other to cas. These “discor- 
dant” reads were excluded from further analysis, 
because they are likely to be noise arising due to 
random ligation events and/or improper single- 
nucleotide polymorphism (SNP) annotation 
(46, 64). All contact maps were normalized using 
the matrix balancing algorithm of Knight and 
Ruiz (65), similar to iterative correction (46, 66), 
using the MATLAB script provided at the end of 
their paper. We were able to generate robust con- 
tact maps using the comp reads in one replicate 
at 40-kb resolution, but because only -44% of 
reads align allele-specifically, we were only able 
to generate contact maps for the cas and mus 
haplotypes at 200 kb. To increase our resolution, 
we pooled together both biological replicates and 
analyzed the comp contact map at 40-kb reso- 
lution and the mus and cas contact maps at 100 kb. 
We called TADs at 40 kb on the X chromosome 
Chr5, and Chrl3 using the method of Dixon et al. 
(27). Specifically, we processed the normalized 
comp 40-kb contact maps separately into a vec- 
tor of directionality indices using DI_from_matrix. 
pi with a bin size of 40,000 and a window size of 
200,000. We used this vector of directionality 
indices as input for the HMM_calls.m script, and 
after HMM_generation, we processed the HMM 
and generated TAD calls by passing the HMM 
output to file_ends_cleaner.pl, converter_7col.pl, 
hmm_probablity_correcter.pl, hmm-state_caller. 
pi, and, finally, hmm-state_domains.pl. We used 
parameters of min = 2, prob = 0.99, binsize = 
40,000 as input to the HMM probability correc- 
tion script. 



aab2276-10 17 july 2015 • vol 349 issue 6245 



sciencemag.org SCIENCE 



RESEARCH | RESEARCH ARTICLE 



To create a general metric describing interac- 
tion frequencies within TADs at resolution avail- 
able in the allele-specific interaction maps, for 
each TAD, on the X chromosome and Chr5 we 
averaged the normalized interaction scores for 
all bins within each TAD, excluding the main 
diagonal. To make comparisons between inter- 
action frequency over TADs between the cas (Xa) 
and mus (Xi) haplotypes at the resolution avail- 
able with our current sequencing depth, we de- 
fend the ‘Traction mus” as the average interaction 
score for a TAD in the mus contact map divided 
by the sum of the average interaction scores in the 
mus and cas contact maps. 

To discover TADs that show significantly in- 
creased interaction frequency in Xi^®VXa^, we 
generated a null distribution of changes in aver- 
age normalized interaction scores for all TADs 
on chromosome 5, 1 to 140 Mb using the cas and 
mus contact maps. We reasoned that there would 
be few changes in interaction frequency on an 
autosome between the mus or cas contact maps 
for wild-type and Xi^®VXa^; thus, the distri- 
bution of fold changes in interaction score on an 
autosome constitutes a null distribution. Using 
this distribution of fold changes allowed us to 
calculate a threshold fold change for an empir- 
ical FDR of 0.05, and all TADs that had a greater 
increase in average normalized interaction score 
on Xi between wild-type and Xi^’^VXa^ were 
considered restored TADs. We performed this 
analysis of restored TADs separately in each bio- 
logical replicate using the 200-kb contact maps 
to generate interaction scores over TADs and using 
the combined data at 100-kb resolution. 
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Blowing magnetic skyrmion bubbles 



Wanjim Jiang/ Pramey Upadhyaya,^ Wei Zhang/ Guoqiang Yu,^ 

M. Benjamin Jungfleisch/ Frank Y. Fradin/ John E. Pearson/ Yaroslav Tserkovnyak/ 
Kang L. Wang,^ Olle Heinonen/’^’^’® Suzanne G. E. te Velthuis/ Axel Hoffinann^* 

The formation of soap bubbles from thin films is accompanied by topological transitions. 
Here we show how a magnetic topological structure, a skyrmion bubble, can be generated 
in a solid-state system in a similar manner. Using an inhomogeneous in-plane current 
in a system with broken inversion symmetry, we experimentally “blow” magnetic skyrmion 
bubbles from a geometrical constriction. The presence of a spatially divergent spin-orbit 
torque gives rise to instabilities of the magnetic domain structures that are reminiscent of 
Rayleigh-Plateau instabilities in fluid flows. We determine a phase diagram for skyrmion 
formation and reveal the efficient manipulation of these dynamically created skyrmions, 
including depinning and motion. The demonstrated current-driven transformation from stripe 
domains to magnetic skyrmion bubbles could lead to progress in skyrmion-based spintronics. 



M agnetic skyrmions are topological spin 
textures that can be stabilized by Dzyalo- 
shinskii-Moriya interactions (DMIs) {1-9) 
in chiral bulk magnets such as MnSi, 
FeGe, etc. Owing to their distinct vortex- 
like spin texture, sl^rmions exhibit many fesdnating 
features, including emergent electromagnetic 
fields, which enable their efficient manipulation 
{4, 5, 8-10). A particularly technologically inte- 
resting property is that skyrmions can be driven 
by a spin-transfer torque mechanism at a very 
low current density, which has been demonstrated 
at cryogenic temperatures (4, 5, 8, 10). Besides bulk 
chiral magnetic interactions, the interfacial sym- 
metry breaking in heavy metal/ultrathin ferro- 
magnet/insulator (HM/F/I) trilayers introduces 
an interfacial DMI (11-14) between neighboring 
atomic spins, which stabilizes Neel walls (cycloi- 
dal rotation of the magnetization direction) with 
a fixed chirality over the Bloch walls (spiral rota- 
tion of the magnetization direction) (15-20). This 
is expected to result in the formation of skyrmions 
with a ‘Tiedgehog” configuration (14, 18, 21-25). 
This commonly accessible material system ex- 
hibits spin Hall effects from heavy metals with 
strong spin-orbit interactions (26), which in turn 
give rise to well-defined spin-orbit torques (SOTs) 
(17, 19, 27-29) that can control magnetization dy- 
namics efficiently. However, it has been experi- 
mentally challenging to use the electric current 
and/or its induced SOTs (8, 21, 23, 24, 27, 30-32) 
for dynamically creating and/or manipulating 
hedgehog skyrmions. Here we address that issue. 
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Central to this work is how electric currents 
can manipulate a chiral magnetic domain wall 
(DW); that is, the chirality of the magnetization 
rotation (as shown in Fig. lA) is identical for 
every DW. This fixed chirality is stabilized by 
the interfacial DMI (17-19, 21, 28). In HM/F/I 
heterostructures, the current flowing through 
the heavy metal generates a transverse vertical 
spin current attributable to the spin Hall effect 
(27), which results in spin accumulation at the 
interface with the ferromagnetic layer. This spin 
accumulation gives rise to a SOT acting on the 
chiral DW (Fig. lA). The resultant effective spin 
Hall field can be expressed as (17-19, 27) 

Bsh X (z xJe)] (1) 

where rh is the magnetization unit vector, z is 
the unit vector normal to the film plane, and 
is the direction of electron particle flux. Here 
can be written as (fi/‘l\e\) • (GghJc/^f^s)? where 
^/2 is the spin of an electron (and h is Planck’s 
constant h divided by 2n), e is the charge of an 
electron, tf is the thickness of the ferromagnetic 
layer, and Ms is the saturation (volume) magneti- 
zation. The spin Hall angle 0sh =Js/Jc is de- 
fined by the ratio between spin current density 
(Js) and charge current density (Jc). For homo- 
geneous current flow along the x axis (Fig. IB), a 
chiral SOT enables efficient DW motion (17-19). 
In the case of a stripe domain with a chiral DW 
(Fig. IB), the symmetry of Eq. 1 leads to a vanish- 
ing torque on the side walls parallel to the cur- 
rent, and therefore only the end of the stripe 
domain is moved. If the opposite end is pinned, 
this results in an elongation of the stripe. 

The situation becomes more complex when the 
stripe domain is subjected to an inhomogeneous 
current flow. This can be achieved by introducing 
a geometrical constriction into a current-carrying 
trilayer wire (Fig. 1C). Such a constriction results 
in an additional current component along the y 
axis: jy around the narrow neck (Fig. ID). The total 
currents is spatially convergent (or divergent) to 




Fig. 1. Schematic of the transformation of stripe 
domains into magnetic skyrmion bubbles. (A) 

Infinitesimal section of a chiral DW in a ferromagnet 
(F)/heavy metal (HM) bilayer illustrating the relation 
between local magnetization vectors and the SOT- 
induced chiral DW motion of velocity V/jw in a device 
with a homogeneous electron current flow along 
the -t-x axis. Blue corresponds to upward orientation 
of magnetization, whereas orange represents the down- 
ward orientation of magnetization. The bottom panel 
illustrates the magnetization directions inside of the 
Neel wall. (B) Top view of a trilayer device. The blue 
region is a stripe-shaped domain. Light blue arrows 
show the in-plane magnetization direction of the DW 
[as shown in the bottom panel of (A)] and indicate 
that the domain has left-handed chirality. The red 
arrows correspond to the current distribution. (C) 
Introducing a geometrical constriction into the de- 
vice gives rise to an inhomogeneous current distrib- 
ution, which generates a flow along the y axis Oy) 
around the narrow neck. This current distribution is 
spatially divergent to the right and convergent to 
the left of the constriction. The y component of the 
current distribution is highlighted in (D). This intro- 
duces an effective spin Hall force (F^^) along the 
y axis that (E) locally expands the stripe domain on 
the right side. (F) Once the expansion approaches 
a critical point, the resultant restoring forces (Fres) 
associated with the surface tension of the DWs are 
no longer able to maintain the shape, and the stripe 
domains break into circular bubble domains, result- 
ing in the formation of synthetic Neel skyrmions. 
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the left (or right) of the constriction {33), Conse- 
quently, inhomogeneous effective forces (F^^) on 
the DWs (caused by the spin Hall field) are created 
along the y axis; these forces act to expand the end 
of the domain (Fig. IE). As the domain end contin- 
ually expands its radius, the surface tension in 
the DW (resulting from the increasing DW energy 
determined by the combination of exchange and 
anisotropy fields) increases {34), which results in 
breaking the stripes into circular domains (Fig. IF). 

This process resembles how soap bubbles de- 
velop out of soap films upon blowing air through 
a straw, or how liquid droplets form in fluid flow 
jets (35). Because of the interfacial DMI in the 
present system, the spin structures of the newly 
formed circular domains maintain a well-defined 
Geft-handed) chirality {13, 14, 23, 24). Once formed, 
these created synthetic hedgehog (Neel) skyrmions 
{14, 23) are stable due to topological protection 
and move very efficiently following the current 
direction, a process that can be described on the 
basis of a modified Thiele equation {36). The dy- 
namic skyrmion conversion could, in principle, 
happen at the other side of the device, where the 
spatially convergent current compresses stripe 
domains. However, sizeable currents and SOTs are 
required to compensate the enhanced (repulsive) 
dipolar interaction. The proposed mechanism dif- 
fers from a recent theoretical proposal with sim- 
ilar geometry, in which skyrmions are formed 
from the coalescence of two independent DWs 
extending over the fiill width of a narrow constric- 
tion at a current density ~10^ A/cm^ {32). For 
repeated skyrmion generation, this latter mech- 
anism requires a continuous generation of paired 
DWs in the constriction, which is inconsistent with 
the experimental observations described below. 

Transforming chiral stripe domains 
into skyrmions 

We demonstrated this idea experimentally with 
a Ta(5 nm)/Co 2 oFe 6 oB 2 o(CoFeB)(l.l nm)/Ta0^3 nm) 
trilayer grown by magnetron sputtering {37, 38) 
and patterned into constricted wires via photo- 
lithography and ion milling {33). The wires have 
a width of 60 pm with a 3-pm-wide and 20-pm-long 
geometrical constriction in the center. Our devices 
are symmetrically designed across the narrow neck 
to maintain balanced demagnetization energy. A 
polar magneto-optical Kerr effect (MOKE) micro- 
scope in a differential mode {39) was used for dy- 
namic imaging experiments at room temperature. 
Before applying a current, the sample was first sat- 
urated at positive magnetic fields and subsequently 
a perpendicular magnetic field of = +0.5 mT 
was applied; sparse magnetic stripe and bubble 
domains prevail at both sides of the wire (Fig. 2A). 
The lighter area corresponds to positive perpendic- 
ular magnetization orientation, and the darker area 
corresponds to negative orientation, respectively. 

In contrast to the initial magnetic domain config- 
uration, after passing a 1-s single pulse of current den- 
sity je = +5 X 10^ A/cm^ (normalized by the width of 
the device: 60 pm), it is observed that the stripe do- 
mains started to migrate, subsequently forming 
extended stripe domains on the left side. These do- 
mains were mostly aligned with the chaige current 



flow and converged at the left side of constriction. 
The stripes were transformed into slq^mion bubbles 
immediately after passing through the constric- 
tion (Fig. 2B). These dynamically created sl^amions, 
varying in size between 700 nm and 2 pm (depend- 
ing on the strength of the external magnetic field), 
are stable and do not decay on the scale of a typical 
laboratory testing period (at least 8 hours). The size 
of the slyrmions is determined by the interplay be- 
tween Zeeman, magnetostatic interaction and 
interfacial DMI. In the presence of a constant 
electron current density of j^ = +5 x 10^ A/cm^, 
these skyrmions are created with a high speed 
close to the central constriction and destroyed at 
the end of the wire. Capturing the transformation 
dynamics of skyrmions from stripe domains is 
beyond the temporal resolution of the present set- 
up. Reproducible generation of slyrmions is dem- 
onstrated by repeating pulsed experiments several 
times {33). The left side of the device remains mainly 
in the labyrinthine stripe domain state after remov- 
ing the pulse current, which indicates that both 
slyrmion bubbles and stripe domains are metastable. 

When the polarity of the charge current is re- 
versed to Je = -5 X 10^ A/cm^, the skyrmions are 
formed at the left side of the device (Fig. 2, C and 
D). This directional dependence indicates that the 
spatially divergent current and SOT, determined 
by the geometry of the device, are most likely 
responsible for slicing stripe DWs into magnetic 



skyrmion bubbles, qualitatively consistent vrith the 
schematic presented in Fig. 1. 

At a negative magnetic field = -0.5 mT and 
current Je = +5 x 10^ A/cm^ (Fig. 2, E and F), a 
reversed contrast, resulting from opposite inner 
and outer magnetization orientations, is observed 
compared vrith positive fields. We varied the exter- 
nal magnetic field and chaige current density sys- 
tematically and determined the phase diagram for 
slyrmion formation shovm in Fig. 2G. A laige pop- 
ulation of synthetic slyrmions is found only in the 
shadowed region, whereas in the rest of phase 
diagram the initial domain configurations remain 
either stationary or flovring smoothly, depending 
on the strength of current density, as discussed 
below. This phase diagram is independent of pulse 
duration for pulses longer than 1 ps. No creation of 
skyrmions in a regular-shaped device vrith a homo- 
geneous current flow (as illustrated in Fig. IB) is ob- 
served up to a current density of^e = +5 x 10^ A/cm^. 

Capturing the transformational dynamics 

The conversion from chiral stripe domains into 
magnetic skyrmions can be captured by decreasing 
the driving current, which slows dovm the trans- 
formational dynamics. Figure 3, A to D, shows the 
dynamics for a constant current density of Je = 
+6.4 X 10^ A/cm^ at B^ = +0.46 mT. The origi- 
nal (disordered) labyrinthine domains on the left 
side squeeze to pass through the constriction (Fig. 




Fig. 2. Experimental generation of magnetic skyrmions. (A) Sparse irregular domain structures are 
observed at both sides of the device at a perpendicular magnetic field of = +0.5 mT. (B) Upon passing a 
current of /e = +5 x 10^ A/cm^ through the device, the left side of the device develops predominantly 
elongated stripe domains, whereas the right side converts into dense skyrmion bubbles. (C and D) By re- 
versing the current direction to /e = -5 x 10^ A/cm^, the dynamically created skyrmions are forming at the left 
side of the device. (E and F) Changing the polarity of the external magnetic field reverses the internal and 
external magnetization of these skyrmions. (G) Phase diagram for skyrmion formation. The shaded area in- 
dicates field-current combinations that result in the persistent generation of skyrmions after each current pulse. 
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3B). The stripe domains become unstable after 
passing through the constriction and are even- 
tually converted into skyrmions on the right side 
of the device, as shown in Fig. 3, C and D. This 
can be seen in more detail in the supplementaiy 
MOKE movies (movies SI and S2). Because the x 
component of the current results in an efficient 
motion of DWs, the skyrmion formation can 
occur away from the constriction. The synthetic 
skyrmions do not merge into stripe domains and, 
in fact, repel each other, indicating their topological 
protection as well as magnetostatic interactions. 

Some important features should be noticed. 
There exists a threshold current ^‘e-sk = ±6 x 
10^ A/cm^ for persistently generating skyrmion 
bubbles from stripe domains for pulses longer than 
1 ps. Above this current, the enhanced spin-orbit 
torques produce the instability of the DWs, which 
results in the continuous formation of skyrmions. 
The present geometry for skyrmion generation is 
very efficient, resulting in the observed threshold 
current that is three orders of magnitude smaller 
than suggested by previous simulation studies 



(10^ to 10^ A/cm^) in MnSi thin films with a bulk 
DMI where the driving mechanism is the conven- 
tional spin-transfer torque {30). Below this threshold 
for continuous skyrmion generation, there is a 
threshold depinning current je-st = ±4.1 x 10^ A/cm^ 
that produces a steady motion of stripe domains. 
The force (pressure) on the stripe from SOT at 
this current exceeds the one required to main- 
tain its shape. When je-st < ie < ie-s^ the stripe 
domains are moving smoothly through the con- 
striction and prevail at both sides of devices, 
with just the occasional formation of skyrmions. 

Depinning and motion of synthesized 
S = 1 skyrmions 

The magnetic skyrmion bubbles discussed so far 
have a topological charge given by the skyrmion 
number S' = 1, as is determined by wrapping the 
unit magnetization vector (m) over the sphere 
'S' = X dyV[v)dxdy (1, 8). These S = 

1 synthetic skyrmions move because of the op- 
posite direction of effective SOTs on the oppo- 
site sides of the skyrmion (Fig. 3E). Following the 



Fig. 3. Capturing the transfor- 
mational dynamics from stripe 
domains to skyrmions and 
motion of skyrmions. (A to D) 

At a constant current density je = 
+6.4 X 10^ A/cm^ and 
6^ = +0.46 mT, the disordered 
stripe domains are forced to 
pass through the constriction 
and are eventually converted 
into skyrmions at the right side 
of the device. Red circles 
highlight the resultant newly 
formed skyrmions. (E) Illustra- 
tion of the effective spin Hall 
field acting on these dynami- 
cally created skyrmions: the 
direction of motion follows the 
electron current. (F to I) Effi- 
cient motion of these skyrmions 
for a current density = +3 x 
10^ A/cm^. (F) First, a Ts-long 
single pulse je = +5 x 10^ A/cm^ 
initializes the skyrmion state. (G to 
I) Subsequently, smaller currents 
(below the threshold current to 
avoid generating additional skyr- 
mions through the constriction) are 
used to probe the current-velocity 
relation. These skyrmions are 
migrating stochastically and moving 
out of the field of view. See supple- 
mentary MOKE movies SI, S2, and 
S4 for the corresponding temporal 
dynamics. (J) The current-velocity 
dependence of skyrmions is ac- 
quired by studying ==20 skyrmions 
via averaging their velocities by 
dividing the total displacement 
with the total time period. 
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initialization by a current pulsey’e = +5 x lO^A/cm^ 
(which is larger than the threshold current y^sk for 
generating skyrmions), we studied the efficient 
depinning and motion of synthetic skyrmions 
(Fig. 3, F to I) at = -0.5 mT. At the current den- 
sity y‘e = +3 X 10^ A/cm^, there is no migration of 
stripe domains through the constriction (hence an 
absence of newly formed skyrmions). However, it 
is clear that the previously generated skyrmions 
at the right side of the device are gradually moving 
away following the electron flow direction. During 
the motion, no measurable distortion of these syn- 
thetic skyrmions is observed within the experi- 
mental resolution, consistent with the well-defined 
chirality of the skyrmion bubble. The average ve- 
locity p = i /At) is determined by dividing the dis- 
placement {£) with the total time period (Af). For 
the present current density, the motion of synthetic 
skyrmion is stochastic and influenced by random 
pinning with an average velocity of ~10 pm/s, the 
current dependence of which is summarized in 
Fig. 3J. The ratio of the velocity to the applied 
current is comparable to what is observed for the 
chiral DW motion in the related systems {17, 19). 

Current characteristics of S = 0 
magnetic bubbles 

Because of the competition between long-range 
dipolar and short-range exchange interaction, a 
system with a weak perpendicular magnetic an- 
isotropy undergoes a spin reorientation transition 
with in-plane magnetic fields that is typified by a 
stripe-to-bubble domain phase transition {39, 40). 
Such an in-plane field-induced bubble state is estab- 
lished by sweeping the magnetic field from | = 
+100 to +10 mT. Current-driven characteristics 
of the in-plane field-induced magnetic bubbles are 
in stark contrast to the mobile magnetic skyrmions 
generated from SOTs. These bubbles shrink and 
vanish in the presence of a positive electron current 
density (Fig. 4, A to E) or elongate and transform 
into stripe domains in the presence of negative elec- 
tron current density (Fig. 4, F to J). Such a distinct 
difference directly indicates the different spin struc- 
tures surrounding these field-induced bubbles and, 
therefore, the different skyrmion numbers. 

For the in-plane field-induced magnetic bub- 
bles, because the spin structures of DWs follow 
the external magnetic fields {18, 41, 42) (Fig. 4K), 
the corresponding skyrmion number is S' = 0. Be- 
cause of the same direction of the effective spin 
Hall fields given by the reversed DW orientations, 
topologically trivial S = 0 magnetic bubbles expe- 
rience opposite forces on the DWs at opposite ends. 
This leads to either a shrinking or elongation of the 
bubbles, depending on the direction of currents, 
which is consistent with our experimental obser- 
vation. This also explains the in-plane current- 
induced perpendicular magnetization switching 
in the presence of in-plane fields {27, 42). 

Perspectives 

Recent experimental efforts toward creating in- 
dividual magnetic skyrmions use either tunneling 
current from a low-temperature spin-polarized 
scanning tunneling microscope {43) or geometri- 
cal confinement via sophisticated nanopatteming 
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Fig. 4. Absence of motion for the in-plane magnetic fields stabilized S = 0 magnetic bubbles. (A to E) 

(A) In-plane magnetic field-induced bubbles are created by first saturating at in-plane field 6|| = +100 mT and 
subsequently decreasing to 6|| = +10 ml Depending on the direction of the current, these magnetic bubbles 
either shrink or expand. (A to E) Shrinking bubbles are observed upon increasing the current density from = -i-5 x 
10^ to -r2.5 X 10^ A/cm^ in steps of 5 x 10^ A/cm^. (F to J) Expansion of bubbles is revealed for currents from je = 
-0.5 X 10^ to -2.5 X 10^ A/cm^ in steps of 5 x 10^ A/cm^. (K) These results are linked to the different spin textures 
(namely, S = 0 skyrmion bubbles) that were stabilized along the DW by the in-plane magnetic fields, which lead to 
different orientations of the effective spin Hall fields and different directions of DW motion, as illustrated. 



(44-46). Our results demonstrate that spatially di- 
vergent current-induced SOTs can be an effective 
way for dynamically generating mobile magnetic 
skyrmions at room temperature in commonly ac- 
cessible material systems. The size of these syn- 
thetic skyrmions could be scaled down by properly 
engineering the material-specific parameters that 
control the various competing interactions in mag- 
netic nanostructures (23, 24, 47). We expect that 
similar instabilities will be generated from diver- 
gent chaige current flows. Whereas the mechanism 
for synthetic skyrmion generation can be qualita- 
tively linked to the spatially divergent spin HaH spin 
torque, a comprehensive understanding of this dy- 



namical conversion, particularly at the picosecond 
or nanosecond time scale where the intriguing mag- 
netization dynamics occurs, requires further exper- 
imental and theoretical investigations. Spatially 
divergent SOT-driven structures also offer a readily 
accessible model system for studying topological 
transitions and complex flow instabilities (35), vdrere 
the parameters governing the flow, such as surface 
tension, can be systematically tuned by the mag- 
netic interactions. At the same time, this dynamic 
approach for sl^rorion generation in the near future 
could enable the demonstration of advanced skyr- 
mionic device concepts; for example, functional 
skyrmion racetrack memory (14, 23, 36, 48). 
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HEAVY FERMIONS 

Unconventional Fermi surface in an 
insulating state 

B. S. Tan/ Y.-T. Hsu/ B. Zeng,^ M. Ciomaga Hatnean/ N. Harrison,^ Z. Zhu,^ 

M. Hartstein/ M. Kiourlappou/ A. Srivastava/ M. D. Johannes/ T. P. Murphy/ 

J.-H. Park/ L. Balicas/ G. G. Lonzarich/ G. Balakrishnan/ Suchitra E. Sebastian^* 

Insulators occur in more than one guise; a recent finding was a class of topological 
insulators, which host a conducting surface juxtaposed with an insulating bulk. Here, 
we report the observation of an unusual insulating state with an electrically insulating 
bulk that simultaneously yields bulk quantum oscillations with characteristics of an 
unconventional Fermi liquid. We present quantum oscillation measurements of magnetic 
torque in high-purity single crystals of the Kondo insulator SmBe, which reveal quantum 
oscillation frequencies characteristic of a large three-dimensional conduction electron 
Fermi surface similar to the metallic rare earth hexaborides such as PrBe and LaBe- The 
quantum oscillation amplitude strongly increases at low temperatures, appearing strikingly 
at variance with conventional metallic behavior. 



K ondo insulators, a class of materials posi- 
tioned close to the border between insu- 
lating and metallic behavior, provide fertile 
ground for unusual physics {1-14). This 
class of strongly correlated materials is 
thought to be characterized by a ground state 
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with a small energy gap at the Fermi energy 
owing to the collective hybridization of conduc- 
tion and /electrons. The observation of quantum 
oscillations has traditionally been associated with 
a Fermi liquid state; here, we present the sur- 
prising measurement of quantum oscillations in 
the Kondo insulator SmBg {IS) that originate from 
a large three-dimensional Fermi surface occupy- 
ing half the Brillouin zone and strongly resem- 
bling the conduction electron Fermi surface in 
the metallic rare earth hexaborides {16, 17). Our 
measurements in SmBg reveal a dramatic depar- 
ture from conventional metallic Lifshitz-Kosevich 
behavior {18); instead of the expected satura- 
tion at low temperatures, a striking increase is 



observed in the quantum oscillation amplitude 
at low temperatures. 

Single crystals of SmBg used in the present 
study were grown by means of the image furnace 
technique {19) in order to achieve high purities 
as characterized by the high inverse residual re- 
sistivity ratio. Single crystals with inverse resist- 
ance ratios [IRR = R{T = 1.8 K)/R{T = 300 K), where 
R is resistance and Tis temperature] of the order 
of 10^ were selected for this study; the IRR has 
been shown to characterize crystal quality, with 
the introduction of point defects through radia- 
tion damage {20) or through off-stoichiometry 
{21), resulting in a decrease in low-temperature 
resistance and an increase in high-temperature 
resistance. The resistance of a SmBg single crys- 
tal is shown in Fig. IB measured as a function of 
temperature at zero magnetic field and in an 
applied DC magnetic field of 45 T, demonstrating 
that activated electrical conductivity character- 
istic of an energy gap ~40 K at the Fermi energy 
persists up to high magnetic fields. The non- 
magnetic ground state of SmBg is evidenced by 
the linear magnetization up to 60 T (Fig. IB, bot- 
tom inset). 

We observed quantum oscillations in SmBg by 
measuring the magnetic torque. The measure- 
ments were done in magnetic fields up to 40 T 
and down toT= 0.4 K and in magnetic fields up 
to 35 T and down to T = 0.03 K Quantum os- 
cillations periodic in inverse magnetic field are 
observed against a quadratic background, with 
frequencies ranging from 50 to 15,000 T (Fig. 1, 
A, C, and D). A Fourier transform of the quantum 
oscillations is shown in Fig. 2A as a function of 
inverse magnetic field, revealing well-defined peaks 
corresponding to multiple frequencies. The peri- 
odicity of the quantum oscillations in inverse 
magnetic field is revealed by the linear Landau 
index plot in Fig. 2B. 

The observation, especially of rapid quantum 
oscillations with frequencies higher than 10 kT 







Fig. 1. Quantum oscillations in the magnetic torque in SmBe- (A) Quantum 
oscillations in magnetic torque are visible against a quadratic background. 
(Inset) Schematic of the magnetic torque measurement setup using a 
capacitive cantilever and the notation for angular rotation by angle 0. (B) Re- 
sistance as a function of temperature in zero magnetic field (blue line) and 
at 45 T (green line) using an unchanged measurement configuration on a 
SmBe sample of dimensions 1.1 by 0.3 by 0.1 mm. (Top inset) Measured 
resistance from 80 mK up to high temperatures, from which the high IRR 



can be ascertained [a fit to activated electrical conductivity is provided in 
(23)]. (Bottom inset) Magnetization of SmBe at 2.1 K remains linear up to 60 T. 
(C) Dominant low-frequency quantum oscillations can be discerned after 
background subtraction of a sixth-order polynomial. (D) Magnetic torque 
at the highest measured fields after the subtraction of the low-frequency 
background torque. Quantum oscillations are visible in an intermediate- 
frequency range (between 2000 and 4000 T) as well as a high-frequency 
range up to 15,000 T. 
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Fig. 2. Landau quantization in SmBe- (A) Fourier transforms of magnetic torque as a function of 
inverse magnetic field, from which a polynomial background has been subtracted, revealing multiple 
quantum oscillatory frequencies ranging from 50 T to 15,000 T. Field ranges for analysis have been 
chosen that best capture the observed oscillations, with the highest frequencies only appearing in 
the higher field ranges. (B) The maxima and minima in the derivative of magnetic torque with 
respect to the magnetic field, corresponding to the dominant low-frequency oscillation, are plotted 
as a function of inverse magnetic field: the linear dependence signals Landau quantization. 



(corresponding to approximately half the volume 
of the cubic Brillouin zone) in SmBg, is striking. 
This observation is in contrast to previous re- 
ports of veiy-low-frequency quantum oscillations 
corresponding to a few percent of the Brillouin 
zone in SmBg, attributed to a two-dimensional 
surface contribution (22). Our observation of very 
high quantum oscillation frequencies requiring 
mean free paths on the order of a few micrometers 
would be challenging to explain from a surface 
layer of a few atomic lengths’ thickness, which 
would typically yield such rapid frequencies only 
at a special angle of inclination at which the cy- 
clotron orbit lies completely within the surface 
layer. Key to identifying the Fermi surface from 
vdiich the observed quantum oscfllation frequencies 
originate is a comparison with previous quantum 
oscillation measurements on metallic hexaborides, 
such as nonmagnetic LaBg, antiferromagnetic 
CeBg, and antiferromagnetic PrBg (16, IT). These 
materials exhibit a metallic ground state involving 
predominantly conduction electrons, with a low 
residual resistivity of the order of one microhm cm 
(~10® times lower than in Kondo insulating SmBg) 
and are characterized by a multiply connected 
Fermi surface of prolate spheroids (Fig. 3, D and 
E). Strikingly, the angular dependence of the var- 
ious quantum oscillation frequencies in SmBg 
reveals characteristic signatures of the three- 
dimensional Fermi surface identified in the me- 
tallic rare earth hexaborides (Fig. 3, A to C). In 
particular, the high observed a frequencies (Fig. 3A) 
reveal the characteristic symmetry of large pro- 
late spheroids centered at X-points of the Brillouin 
zone (Fig. 3, D and E), whereas the lower observed 
frequencies (Fig. 3A) reveal the characteristic 
symmetry of small ellipsoids located at the neck 
positions. Both of these types of ellipsoids are 
universal Fermi surface features identified from 
experiment and band structure calculations in 
the metallic rare earth hexaborides (16, 17). Sim- 
ilar features are also revealed in density functional 
calculations of SmBg when the Fermi energy is 
shifted from its calculated position in the insu- 
lating gap either up into the conduction or down 
into the valence bands (Fig. 3, D and E) (23). 

The observed angular dependence of quantum 
oscillations in SmBe remains the same irrespec- 
tive of whether the sample is prepared as a thin 
plate with a large plane face perpendicular to the 
[110] direction, or to the [100] direction (fig. SI), 
and exhibits the same characteristic signatures 
with respect to the orientation of the magnetic 
field to the crystallographic symmetry axes of 
the bulk crystal (Fig. 3A). The bulk quantum 
oscillations we measure in SmBg corresponding 
to the three-dimensional Fermi surface mapped 
out in the metallic rare earth hexaborides may 
not be directly related to the potential topolog- 
ical character of SmBg, which would have as its 
signature a conducting surface (24). In addition 
to the magnetic torque signal from the atom- 
ically thin surface region being several orders 
of magnitude smaller than the signal from the 
bulk, the observation of surface quantum oscil- 
lations would be rendered more challenging by 
the reported Sm depletion and resulting recon- 



struction of Sm ions at the surface layer of 
SmBg (25). 

The unconventional character of the state we 
measure in SmBg becomes apparent upon inves- 
tigating the temperature dependence of the quan- 
tum oscillation amplitude in SmBg. We found 
that between r = 25 K and 2 K, the quantum os- 
cillation amplitude exhibits a Lifshitz-Kosevich- 
like temperature dependence (Fig. 4) characteristic 
of a low effective mass similar to that of metallic 
LaBe, which has only conduction electrons (16). 
The comparable size of low-temperature electronic 
heat capacity measured for our SmBg single crys- 
tals to that of metallic LaBg (23) also seems to 
suggest a large Fermi surface with low effective 
mass in SmBg. However, instead of saturating at 
lower temperatures as would be expected for the 
Lifshitz-Kosevich distribution characteristic of 
quasiparticles with Fermi-Dirac statistics (18), the 
quantum oscillation amplitude increases dramat- 
ically as low temperatures down to 30 mK are 
approached (Fig. 4). Such non-Lifshitz-Kosevich 
temperature dependence is remarkable, given the 
robust adherence to Lifshitz-Kosevich tempera- 
ture dependence in most examples of strongly 
correlated electron systems, from the underdoped 
cuprate superconductors (26) to heavy fermion 
systems (27, 28) to systems displaying signatures 



of quantum criticality (29), a notable exception 
being fractional quantum Hall systems (30, 31). 
The possibility of a subtle departure from Lifshitz- 
Kosevich temperature dependence has been re- 
ported in a few materials (32, 33). 

The ground state of SmBg is fairly insensitive 
to applied magnetic fields, with activated electri- 
cal conductivity behavior across a gap remaining 
largely unchanged up to at least 45 T (Fig. IB). 
Such a weak coupling to the magnetic field is in 
contrast to unconventional states in other mate- 
rials that are tuned by an applied magnetic field 
(6-11, 13). Furthermore, this rules out the pos- 
sibility of quantum oscillations in SmBg arising 
from a high magnetic field state in which the 
energy gap is closed. The possibility that quan- 
tum oscillations arise from static, spatially dis- 
connected metallic patches of at least 1-pm length 
scale that do not contribute to the electrical trans- 
port also appears unlikely. Similar quantum os- 
cillations are observed in all (more than 10) 
measured high-quality samples in multiple high- 
magnetic-field experiments, with the best sam- 
ples yielding magnetic quantum oscillations of 
amplitude corresponding to a substantial frac- 
tion of the expected size from bulk SmBg. The 
presence of rare earths other than Sm has been 
ruled out to within 0.01% by means of chemical 



288 17 JULY 2015 • VOL 349 ISSUE 6245 



sciencemag.org SCIENCE 



RESEARCH \ REPORTS 




Fig. 3. Angular dependence of the quantum oscillation frequencies in 
SmBe. (A) Data from two of the SmBe samples in which oscillations were 
observed are shown, indicated by solid and open circles. One of the samples 
(solid circles) was prepared as a thin plate with the dominant face per- 
pendicular to the [100] axis [sample 1 (23)], and the second sample (open 
circles) was prepared as a thin plate with the dominant face perpendicular to 
the [110] axis [sample 2 (23)]. (B and C) The angular dependence strongly 
resembles that of the three-dimensional Fermi surface in antiferromagnetic 
PrBe shown in (B), and nonmagnetic LaBe shown in (C) (17). (D and E) The a 
orbit in red in all the rare earth hexaborides is fit to large multiply connected 



prolate spheroids centered at the X points of the Brillouin zone, shown in (D); a 
cross section in the XM plane is shown in (E). The p and p' orbits in each of the 
rare earth hexaborides are fit to small ellipsoids located at the neck positions 
[not shown in (D) and (E)]. More details of the fits are provided in (23). The 
remaining intermediate orbits are shown with lines as a guide to the eye. All 
orbit identifications have been made after measured frequencies and band 
structure calculations in PrBe and LaBe (17). (D) and (E) show Fermi surfaces 
calculated for SmBe using density functional theory (23), with a downward shift 
of the Fermi energy from its calculated position within the gap to expose the 
unhybridized bands and yield pocket sizes similar to experiment. 



Fig. 4. Temperature dependence 
of quantum oscillation amplitude. 

The dominant 330 T frequency over 
the magnetic field range 25 to 35 T 
is shown, revealing a steep increase 
in amplitude at low temperatures. 

The measurements in the temper- 
ature range from 25 K down to 
0.35 K were performed in a 
^He fridge in the hybrid magnet 
[sample 1 (23), blue diamonds], 
whereas the measurements at 
temperatures in the range from 1 K 
down to 30 mK were done in a 
dilution fridge in the resistive 
magnet on two different samples 
[sample 1, purple diamonds: sample 
3 (23), blue squares]. At low 
temperatures, a strong deviation 
from the conventional Lifshitz-Kosevich 
form can be seen in the inset by 
comparison with a simulated T(K) 

Lifshitz-Kosevich form for effective 

mass m* = O.lSmg. A logarithmic temperature scale is used in the inset for clarity. 




analysis and scanning electron microscopy (23). 
Off-stoichiometric metallic regions of SmBg ap- 
pear an unlikely explanation for our results, 
given reports that up to 30% Sm depletion does 
not close the energy gap (20), whereas scanning 
electron microscopy of our samples reveals a 
homogeneity of within 1% of Sm concentration 



over the sample area (23). The possibility of spa- 
tially disconnected strained regions of SmBg, 
which is known to become metallic under ap- 
plied pressures on the order of 10 GPa, or static 
spatially disconnected islands of hybridized and 
unhybridized Sm /-electrons also seems unlike- 
ly. An improvement in the IRR by means of the 



removal of strain with electropolishing strength- 
ens the quantum oscillation signal, whereas 
straining the sample by means of thermal cy- 
cling weakens the quantum oscillation signal 
(23). Further, the interplay between hybridized 
and unhybridized Sm /electrons, which may be 
an important ingredient in the physics of SmBg, 
has been revealed by Mossbauer and muon-spin 
relaxation experiments to be homogeneous and 
dynamically fluctuating, rather than being man- 
ifested as static spatially inhomogeneous re- 
gions (34, 3S). 

The insulating state in SmBg in which low- 
energy excitations lack long-range charge trans- 
port, as shown by the activated dc electrical 
conductivity, but display extended character, as 
shown by quantum oscillations, poses a mystery. 
A clue might be provided by slow fluctuations be- 
tween a collectively hybridized insulating state 
and an unhybridized state in which the conduc- 
tion electrons form a solely conduction electron 
Fermi surface, similar to that we observe (3, 35-39). 
A fluctuation time scale in the range between 
10"^ and 10"^^ seconds is suggested by previous 
x-ray absorption spectroscopy and Mossbauer 
measurements (40). This time scale is longer or 
comparable with the inverse cyclotron frequency 
(1/cOc), which is on the order of 10"^^ seconds for 
the measured cyclotron orbits. Intriguingly, sim- 
ilar slow fluctuations have been invoked to explain 
quantum critical signatures in the metallic /electron 
system p-YbAlB 4 (41). SmBg may be viewed as 
being on the border of quantum criticality in the 
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sense that it transforms from a nonmagnetic 
insulating phase to a magnetic metallic phase 
under applied pressures on the order of 10 GPa 
(42-45), which is in contrast to other metallic 
rare earth hexaborides in which the /-electrons 
order magnetically in the ambient ground state. 
Our observation of a large three-dimensional 
conduction electron Fermi surface revealed by 
quantum oscillations may be related to reports 
of a residual density of states at the Fermi energy 
in SmBg through measurements of heat capacity 
(23, 46), optical conductivity (47), Raman scat- 
tering (48), and neutron scattering (49). Another 
possibility is that quantum oscillations could 
arise even in a system with a gap in the excitation 
spectrum at the Fermi energy, provided that the 
size of the gap is not much larger than the cy- 
clotron energy (50). Within this scenario, the re- 
sidual density of states observed at the Fermi 
energy with complementary measurements and 
the steep upturn in quantum oscillation ampli- 
tude we observe at low temperatures appear chal- 
lenging to explain. 
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NANOPARTICLE IMAGING 



C olloidal nanoparticles are clusters of hun- 
dreds to thousands of inorganic atoms typ- 
ically surrounded by organic ligands that 
stabilize them in solution. The atomic ar- 
rangement of colloidal nanoparticles deter- 
mines their chemical and physical properties, which 
are distinct from bulk materials and can be ex- 
ploited for many applications in biological imag- 
ing, renewable energy, catalysis, and more. The 3D 
atomic arrangement on the surface and in the core 
of a nanociystal influences the electronic struc- 
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ture, which affects how the nanocrystal functions 
in catalysis or how it interacts with other com- 
ponents at the atomic scale (1). Introduction of 
atomic dopants, surface adatoms, defects, and grain 
boundaries alters the chemical properties of nano- 
ciystals (2). Ensembles of synthesized nanociystals 
in solution are structurally inhomogeneous because 
of the stochastic nature of nanocrystal nucleation 
and growth (3, 4). Therefore, a method for deter- 
mination of the 3D atomic arrangement of indi- 
vidual unique nanoparticles in solution is needed. 



3D structure of individual 
nanocrystals in solution 
by electron microscopy 

Jungwon Park/’^’^* Hans Elmlund,^’^* Peter Ercius,®* Jong Min Yuk/’®’® 

David T. Limmer/*^ Qian Chen/’®’^^ Kwanpyo Kim/^ Sang Hoon Han/^ 

David A. Weitz,^’^ A. Zettl/’®’® A. Paul Alivisatos^’®’®! 

Knowledge about the synthesis, growth mechanisms, and physical properties of colloidal 
nanoparticles has been limited by technical impediments. We introduce a method for 
determining three-dimensional (3D) structures of individual nanoparticles in solution. We 
combine a graphene liquid cell, high-resolution transmission electron microscopy, a direct 
electron detector, and an algorithm for single-particle 3D reconstruction originally 
developed for analysis of biological molecules. This method yielded two 3D structures of 
individual platinum nanocrystals at near-atomic resolution. Because our method derives 
the 3D structure from images of individual nanoparticles rotating freely in solution, it 
enables the analysis of heterogeneous populations of potentially unordered nanoparticles 
that are synthesized in solution, thereby providing a means to understand the structure 
and stability of defects at the nanoscale. 
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Electron tomography is routinely used for 3D 
analysis of materials (5-9). This method cannot 
be applied to individual particles in a liquid be- 
cause it relies on acquisition of images of a single 
object at many different tilt angles over a period 
of 2 to 5 hours, assuming the object is static dur- 
ing the entire acquisition. Single-particle cryo- 
electron microscopy (cryo-EM) is a common 
method for the determination of 3D structures 
in biological sciences. The average 3D Coulomb 
potential map (i.e., density) of a protein is recon- 
structed from tens of thousands of transmission 
electron microscopy (TEM) images of randomly 
oriented copies of the same protein embedded in 
vitreous ice (10). The unknovm 3D projection an- 
gles of the images are determined by computa- 
tional methods (11). Single-particle cryo-EM has 
succeeded in reconstructing biological molecules 
with nearly 3 A resolution (10, 12). A similar ap- 
proach was recently applied to reconstruct the 
atomic structure of homogeneous ultrasmall gold 
clusters (13). However, the single-particle method 
is not readily applicable to 3D reconstruction of 
colloidal nanoparticles because of their intrinsic 
structural inhomogeneity at the atomic level. 

TEM has undergone technical improvements 
in the past decades (5, 14-17). The image reso- 
lution has been improved with the introduction 
of electron lens aberration correctors (15). The 
development of direct electron detectors has led 
to improvements in image quality and rapid 
acquisition of movies that allow for compensa- 
tion for beam-induced specimen motion, thereby 
providing a substantial enhancement to single- 
particle cryo-EM (18). 

We present a hybrid method for reconstruct- 
ing the 3D structures of individual nanoparticles 
in solution. The method represents a combina- 
tion of three technological advancements from 
TEM imaging in biological and materials sci- 
ence: (i) the development of graphene as a cover- 
ing to hold a liquid in vacuum (the so-called 
graphene liquid cell, or GLC) that allows atomic- 
resolution imaging of nanoparticles that move 
and rotate freely in solution by aberration-corrected 
TEM (3, 19)\ (ii) the advent of direct electron 
detectors, producing movies with millisecond 
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frame-to-frame time resolution of the rotating 
nanocrystals (18); and (hi) a theory for ab initio 
single-particle 3D reconstruction, used to solve 
the inverse problem of recovering the unknown 
3D orientations of the individual noisy nano- 
ciystal projections (11). The resulting hybrid tech- 
nique, 3D structure identification of nanoparticles 
by GLC EM (abbreviated as SINGLE), was used 
to separately reconstruct the 3D structures of two 
individual Pt nanocrystals in solution. 

Pt nanoparticles were chosen because of their 
high electron scattering strength, because their 
detailed atomic structure is important for catal- 
ysis, and because earlier graphene liquid cell 
studies have shovm that they grow by nano- 
particle aggregation, resulting in complex struc- 
tures that are not possible to determine by any 
previously developed method. Pt nanocrystals 
with sub-2 nm diameter were prepared in solu- 
tion. Two graphene sheets were grown by the 
chemical vapor deposition method and used to 
entrap solvated nanocrystals (3). The graphene 
provides an ultrathin covering of material to 
maintain liquid conditions in the TEM vacuum 
and presents an inert surface onto which the 
nanoparticles do not adsorb. The translational 
and rotational motions of the particles in liq- 
uid pockets with sub-50 nm diameter were 
imaged in situ using TEAM I, a TEM instrument 



with geometrical and chromatic aberration cor- 
rection, operated at 300 kV using a direct elec- 
tron detector (Fig. lA). The corrector was set to 
apply a slightly negative spherical aberration coef- 
ficient of about -10 pm. Under these imaging con- 
ditions, we produced white atom contrast when 
using a small underfocus value of 30 to 50 A. 

Movies of the moving particles had a temporal 
resolution of 50 frames/s, a field of view of 1024 x 
1024 pixels, and a Nyquist sampling limit of 0.56 A 
Each movie frame represents a 2D projection of 
many particles in random orientations. To re- 
construct the individual 3D structures, we chose 
a small region around each single particle of in- 
terest in each individual frame to create a set of 
projections per particle. The 3D orientations of 
the resulting series of noisy 2D images of a 
single rotating particle were recovered compu- 
tationally using an ab initio 3D reconstruction 
algorithm adapted from one originally developed 
to recover orientations from cryo-EM images of 
many identical individual particles (11). 

Although the TEM movies contain many par- 
ticles, not all particles could be used for recon- 
struction because of overlap vyith other particles 
and insufficient rotation. Here, we present the 
two most reliable 3D reconstructions from a 
1561-image series (particle 1 in Fig. 1, B and C, 
and movie SI) and a 1171-image series (particle 2 



Fig. 1. Schematic illus- 
tration of in situ TEM 
imaging of Pt nanoctystals 
freely rotating in a 
graphene liquid cell 
(GLC) and 3D EM 
density maps calculated 
from individual Pt nano- 
particles in solution. 

(A) A movie of the single 
rotating Pt nanocrystal pro- 
vides 2D projected TEM still 
snapshots in many orienta- 
tions for ab initio particle 
reconstruction. (B) EM den- 
sity map obtained from the 
3D reconstruction of parti- 
cle 1. The orientation of the 
particle is aligned to expose 
{111} planes of the core 
domain. Three distinct 
crystal domains can be 
identified. (C) 3D EM den- 
sity map of particle 1 with 
alternative viewing angles. 

(D) EM density map 
obtained from the 3D 
reconstruction of particle 2. 

(E) 3D EM density map of 
particle 2 with alternative 
viewing angles. Each panel in 

(C) and (E) presents the two 
particles with the same angle 
and direction with respect to 
the orientations in (B) and 

(D) . Scale bars, 0.5 nm. 
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in Fig. 1, D and E, and movie S2). Shovm in Fig. 1, 
B to E, are the EM density maps of the two Pt 
nanocrystals. The rendered particle volumes are 
5300 (diameter 22 A) for particle 1 and 4800 A^ 

(diameter 20 A) for particle 2. Along with the 
direct visualization of the spatial distribution of 



Pt atomic planes, external and internal struc- 
tures of the particle are uncovered. Each recon- 
struction shows three distinct crystal domains in 
both of the Pt particles. In Fig. 1, B and D, we show 
views of the EM maps in an orientation that re- 
veals distinct lattice planes of the core domain. 



Differently oriented 3D density maps are shovm 
in Fig. 1, C and E, and movies SI and S2. 

Our 3D reconstruction methodology produced 
reconstructions at near-atomic resolution from 
relatively small sets of noisy experimental TEM 
images of nanocrystals in random orientations. 




particle 1 particle 2 




Fig. 2. The underlying structural principle of the small Pt nanoparticles 
studied here. (A) Schematic illustration of the front view of particles 1 and 
2 shown in Fig. 1. Both particles are composed of a dense central disc of 
atomic planes (blue lines) with conical protrusions (orange lines) anchoring 
on each side of the disc. (B) Cross-sectional view of the EM density map of 
particle 1 and 2 along the vertical plane. (C) View orthogonal to (A), showing 
the overlaid lattices. Scale bars, 0.5 nm. (D) Schematic illustration of the top 
view of particles 1 and 2 shown in (C). (E) Cross-sectional view of the EM 
density map of particles 1 and 2 along the equatorial plane. 




Fig. 3. Cross-sectional study of particle 1. (A) 3D 

density map of particle 1 with color coding to highlight 
the three sections. Cross sections 1 and 2 are in arbi- 
trary positions near crystal domain interfaces. Scale bar, 
0.5 nm. (B) Slab through the 3D reconstruction of par- 
ticle 1 along the vertical plane, with tentative atomic po- 
sitions indicated. ABC repeats of {111} planes are visible. 
(C) Slab along cross section 1. The exposed (100) sur- 
face of the fee Pt crystal is shown consistently in both 
exposed surfaces. Intersections with {111} planes in the 
top domain are displayed by red dashed lines. (D) Slab 
along the equatorial plane of (A) exposes a (110) plane. 
The intersections with {111} planes are shown by blue 
dashed lines. Pseudo-atomic structure (right) demon- 
strates the rotation angle (14°) between the (100) and 
(110) surfaces from the top and bottom domains, re- 
spectively. (E) Slab along cross section 2. The disordered 
(100) surface of the fee Pt crystal is shown consistently 
in both exposed surfaces. Intersections with pseudo 
{111} planes in the bottom domain are displayed by red 
dashed lines. 
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We tested the validity of SINGLE using 1000 
multislice TEM simulations of a randomly ori- 
ented Pt nanocrystal with dodecahedral sym- 
metry and corresponding size. Reconstructions 
were obtained from images with a signal-to-noise 
ratio (SNR) roughly corresponding to that of the 
experimental images (see fig. SI). This verified 
that the experimental images can conform to the 
projection-slice theorem with ideal microscope 
conditions ( 20 ). 

To confirm the existence of a projected lattice 
in the experimental images, we calculated their 
power spectra, which showed distinct spots along 
different crystalline zone axes (fig. S2). Closer ex- 
amination of the power spectra and correspond- 
ing orientation coverage throughout the image 
series revealed that each particle continuously 
undergoes small local rotations followed by rapid 
orientation changes, often accompanied by later- 
al movement. The noise variance in the individ- 
ual frames of the movie exceeds the signal variance 
by approximately a factor of 5, despite the strong 
scattering from the heavy Pt atoms (movies S3 
and S4 show 200 raw TEM images). Although 
sufficiently thin to observe atomic detail, the liq- 
uid between the graphene sheets introduces a 



granular background, making it difficult to dis- 
tinguish the facets of the nanocrystal in the indi- 
vidual frames. To enhance the image contrast 
and allow accurate 3D orientation determina- 
tion, we averaged the image series in sets of five 
consecutive frames, resulting in a time resolution 
of -100 ms. After frame averaging, we carefully 
scanned the image series to remove a small frac- 
tion (-20%) of those averages that did not show 
any lattice contrast because the particles had 
moved out of the narrow ideal focal plane of the 
aberration-corrected microscope. 

Initial 3D models were obtained using our re- 
cently developed framework (PRIME) for ah initio 
single-particle 3D reconstruction ( 11 ). The stan- 
dard technique used in biological single-particle 
cryo-EM ( 10 ) assigns each image the single best 
matching orientation, as determined by correla- 
tion matching to a gallery of reference images, 
obtained by projecting an a priori available 3D 
reference model. Two fundamental limitations 
of the standard technique are the bias introduced 
by the initial 3D model and the lack of mecha- 
nisms for modeling the alignment errors when 
data are noisy and the model is of poor quality. 
These limitations may be quite substantial when 



reconstructing nanocrystals, because every parti- 
cle is different and the particle population cannot 
be averaged, as in traditional single-particle EM. 
The individual frames also have a low SNR be- 
cause of the relatively low per-frame electron dose 
and because of the granular background intro- 
duced by the liquid. PRIME overcomes these 
drawbacks by using weighted orientation as- 
signment and stochastic optimization for de- 
termination of an optimal orientation weight 
distribution without any a priori information 
about the nanoparticles. 

To initialize the 3D reconstruction process, we 
assigned the random orientations to images, pro- 
ducing a featureless spherical density map. The 
random orientations were refined by stochastic 
optimization of the correlation between the im- 
ages and reprojections of the density map, using 
information from 30 A to 3 A and a discrete 
search space of orientations. The resulting initial 
model had a resolution of 2.5 A according to the 
0.143 Fourier shell correlation (ESC) criterion 
( 21 ). We extended the PRIME algorithm by in- 
troducing a continuous orientation search space 
and used stochastic optimization to determine a 
continuous distribution of weights that related 
the continuous distribution of orientation param- 
eters to the 3D reconstruction. Each round of 
the PRIME iterative alignment procedure in- 
volved determination of orientation weights for 
all particle images, followed by a weighted 3D 
reconstruction by direct Fourier reconstruction 
using a Kaiser-Bessel interpolation kernel. A few 
hundred iterations were executed, and in every 
round the ESC was calculated and used to con- 
struct a 3D Wiener filter that filtered the map 
such that the optimal SNR was obtained at the 
present resolution ( 22 ). The resolution of the final 
refined maps was measured to 2.10 A (particle 1) 
and 2.14 A (particle 2), respectively. 

Even though the ESC methodology provides 
an accurate measure of the resolution when the 
reference structure used for matching has been 
appropriately low-pass filtered, it does not en- 
sure against grossly incorrect structures. To vali- 
date our structures, we determined the agreement 
between the individual images used for recon- 
struction and the corresponding reprojections of 
the reconstructed 3D map. All image reprojec- 
tion pairs (fig. S3) were generated for the two 
reconstructions, and the Fourier ring correlation 
(FRC) ( 23 ) was calculated between all pairs. The 
average FRC was larger than 0.143 to a resolution 
of 1 A and showed a distinct peak spanning the 
1 to 2 A resolution region (fig. S3). This peak is 
due to the correlation between atomic densities 
in the reprojections and atomic densities in the 
images. We concluded that our reconstructions 
showed excellent agreement with the images. 
The spatial resolution is higher for particle 1, 
which has a larger number of frames, indicating 
that the present resolution can be improved by 
acquisition of longer movies that cover a wider 
range of rotational orientations. 

We had anticipated that the Pt nanocrystals 
would have at least twofold rotational symmetry, 
perhaps even cubic symmetry. Remarkably, the 
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Fig. 4. Twist grain boundary free energies as a function of misalignment angle for the (IIO)-(IOO) 
crystal planes of a nanocrystal with 1135 atoms. Shaded regions indicate error bars of 1 and 2 

standard deviations. The inset shows a relaxed nanocrystal with an initial misalignment angle of 14° The 
non-flat grain boundary is located in the center of the nanocrystal. Images around the perimeter show the 
two relaxed planes at the grain boundary from the 100 (red) and 110 (white) grains for different 
misalignment angles. 
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reconstructions appeared asymmetrical, and at- 
tempts to apply C2, C4, C5, and D2 point-group 
symmetry by aligning the images to the principal 
symmetry axis and doing symmetric orientation 
refinement failed to improve the correlation be- 
tween the reprojections and the images. This 
suggests that the Pt nanocrystals do not possess 
icosahedral, cubic, or pentagonal symmetry but 
are intrinsically asymmetrical. Reconstruction 
by the traditional single-particle cryo-EM techni- 
que, using an initial model based on a faceted 
face-centered cubic (fee) nanoparticle, would suf- 
fer severe initial model bias and would not repro- 
duce the true asymmetrical multidomain structure. 

Our reconstructions of particles 1 and 2 re- 
vealed different asymmetrical crystal structures 
with the same underlying principle: a dense cen- 
tral disc of atomic planes (the core) with conical 
extensions anchored on each side, protruding in 
opposing and orthogonal directions with respect 
to the equatorial plane (see schematics in Fig. 2A 
with the lattice in different crystal domains col- 
ored blue and orange). Despite this similarity, 
the reconstructed 3D structures of the two Pt 
particles show interesting differences. First, the 
atomic arrangements on the surfaces observed in 
the EM density maps of particle 1 and 2 are dis- 
tinct (Fig. 1C and Fig. 2E). Second, the two 
particles show different degrees of crystallinity 
in each domain. Whereas straight {111} crystal 
planes in three domains of particle 1 are shown 
throughout the cross-sectional images along the 
mid-vertical plane (Fig. 2B) and vertical planes at 
different depths (fig. S4), particle 2 shows more 
disordered internal structures within the do- 
mains. Also in particle 1, the conical protru- 
sions have {111} atomic planes tilted with respect 
to the {111} atomic planes in the core. Particle 2 
exhibits protrusions with a larger degree of dis- 
order relative to the well-aligned {111} lattices 
in the particle 1 protrusions. Views of the recon- 
structions perpendicular to the equatorial plane 
(Fig. 2C) and a schematic illustration (Fig. 2D) 
show the overlaid lattices of the core (blue lines) 
and protrusions (yellow lines) with different tilt- 
ing angles for particle 1 (14°) and particle 2 (7°). 

The cross-sectional views along the equatorial 
plane (Fig. 2E) for particles 1 and 2 show similar 
arrangements of the {111} lattice planes, but cross- 
sectional images along the horizontal planes be- 
low and above the equatorial plane for the two 
particles (particle 1 in Fig. 3, C to E; particle 2 in 
fig. S5) indicate that the two particles are as- 
sembled by multiple domains but in unique geom- 
etries. The multiple domains and twisted grain 
boundaries that are present in the reconstruc- 
tions are similar to previous observations of Pt 
nanocrystal growth trajectories, where small par- 
ticles were observed to join along surfaces with 
low ligand coverage (3). The multidomain arrange- 
ment is also supported by other tomography 
reconstructions of larger Pt nanocrystals contain- 
ing multiply twinned domains with decahedral 
symmetry and central screw dislocations (3, 5). 
Multidomain structures are common in many 
other colloidal metal nanoparticles, which likely 
evolve as a result of multiple coalescence events 



during growth (3, 24-26). Our observation of het- 
erogeneously structured Pt nanoparticles formed 
in the same solution confirms that individual 
particles from the same synthesis follow different 
nucleation and growth trajectories. 

Figure 3A presents a 3D density map with col- 
ored sections indicating the upper (blue), core 
(gray), and lower (purple) domains of particle 1. 
Figure 3B shows a cross section perpendicular to 
the equatorial plane with the tentative atomic 
positions (different colored dots in each domain) 
indicating {111} lattice planes of fee crystal that 
have repetition in every three crystal planes in all 
three crystal domains. Atoms closer to the sur- 
face seem to deviate farther from a perfect fee 
structure. This is presumably explained by the 
fact that surface atoms are prone to relax excess 
free energy because of insufficient coordination 
and stabilization by surface ligand binding. In 
addition, the interface between the domains is 
disordered and not fiat— a general consequence 
of the reduction in surface energies expected for 
nanoscale crystals relative to their bulk counter- 
parts. The locations of three cross sections along 
horizontal planes (cross section 1, equator, and 
cross section 2) are indicated in Fig. 3A. Cross 
sections 1 and 2 are positioned near the interface 
between the central disk and the upper and low- 
er conical protrusions, respectively. Cross section 
1 exposes two facing surfaces from the upper and 
core domains (blue and gray densities in Fig. 3C, 
respectively). Cross section 1 shows a (100) surface 
with red dashed lines that trace the {111} planes 
exposed on the (100) surface. The cross section at 
the equator (Fig. 3D) exhibits a (110) surface, and 
blue dashed lines indicate the {111} planes ex- 
posed to the (110) surface. The red and blue lines 
are mapped onto the pseudo-atomic illustra- 
tion of (100) and (110) surfaces with the proper 
orientation (a 14° rotation angle) in the right 
image of Fig. 3D. At cross section 2, near the 
interface between the central disk and the lower 
conical protrusion, a surface structure with pseu- 
do (100) patterns traced by red dotted lines is 
exposed from the core and lower domains (gray 
and purple densities in Fig. 3E, respectively), 
which deviate from the ideal fee structure. Figure 
3 shows that the conical protrusions and the core 
join along (100) and (110) surfaces with distor- 
tions at the interface. Multiple domains merging 
along the low-index crystal planes, such as (100) 
and (110), are presumably formed by coalescence 
events between small particles during the particle 
growth as a route to minimize excess surface en- 
ergy. We previously observed a similar scenario: 
Two small Pt particles join along the (111) surfaces 
during growth (3). Presumably, coalescence along 
the low-index surfaces and the consequent evolu- 
tion of the interface structure are mechanisms by 
which the nanociystals reduce free energy. 

To examine whether there is a thermodynamic 
rationale for the multidomain structures that we 
observed, we computed the free energies for the 
grain boundary formation and for the ligand- 
exposed surfaces. This was done using the Frenkel- 
Ladd method (27) for an embedded atom model 
of Pt (28), using an appropriate thermodynamics 



integration path (fig. S6). We found that for low- 
angle misalignments, like those observed in par- 
ticle 1 [14° rotation angle between {111} planes 
exposed on (110) and (100) surfaces from the cen- 
tral disc and upper domain in Fig. 3], the grain 
boundary free energy for a nanoparticle with 
1200 atoms is 0.3 J/m^, increasing to 0.55 J/m^ 
for misalignment angles of up to 59° (Fig. 4). 
Small rotational relaxation of the crystal grains is 
found to be energetically viable at these inter- 
faces, which may explain the orientation of the 
side protrusions in particle 1 with respect to its 
central section. 

The surface free energy for the ligand-exposed 
interfaces was computed to be 2.8 J/m^ as aver- 
aged over the (100) and (111) surfaces, weighted 
appropriately for the cuboctahedral shape. The 
disparity of scales between the grain boundary 
free energies and the much larger exposed sur- 
face free energies confirms that there is a large 
thermodynamic driving force for coalescence, 
even when such events result in grain boundary 
formation. As the free energy gain upon coalescence 
is much larger than thermal energies, initial aggre- 
gations are likely irreversible. The resultant grain 
boundaries are then kinetically arrested over lab- 
oratory time scales (see supplementary materials). 

Our results show that the SINGLE methodol- 
ogy can be used to investigate the structural 
principles underlying the assembly and transient 
morphology of any stable, small nanoparticle in 
solution. We envision that SINGLE can be ap- 
plied directly to in situ 3D structural studies of 
many other kinds of solvated particles. 
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ANIMAL PHYSIOLOGY 

Summer declines in activity and body 
temperature offer polar bears 
limited energy savings 

J. P. Whiteman, H. J. Harlow,^ G. M. Dumer,^ R. Anderson-Sprecher,^ 

S. E. Albeke,^ E. V. Regehr,® S. C. Amstnip,’^ M. Ben-David^’^ 

Polar bears (Ursus maritimus) summer on the sea ice or, where it melts, on shore. Although 
the physiology of “ice” bears in summer is unknown, “shore” bears purportedly 
minimize energy losses by entering a hibernation-like state when deprived of food. Such a 
strategy could partially compensate for the loss of on-ice foraging opportunities caused 
by climate change. However, here we report gradual, moderate declines in activity and body 
temperature of both shore and ice bears in summer, resembling energy expenditures 
typical of fasting, nonhibernating mammals. Also, we found that to avoid unsustainable 
heat loss while swimming, bears employed unusual heterothermy of the body core. Thus, 
although well adapted to seasonal ice melt, polar bears appear susceptible to deleterious 
declines in body condition during the lengthening period of summer food deprivation. 



T he current rate of Arctic sea-ice loss, un- 
precedented in at least the past several 
thousand years, is outpacing predictions 
and accelerating (i). This raises concerns 
about the persistence of polar bears (Ursus 
nrnritimus) {2), which hunt on the surface of 
the sea ice, most successfully between April and 
July (3), when ringed seals {Pusa hispida) use 
this substrate for rearing pups and molting (4). 
Between August and October, hunting can be 
poor (5) as seals reduce ice surface time (4). Ad- 
ditionally, in about two-thirds of the polar bear 
range (d), seals become largely pelagic as ice 
retreats from the continental shelf (7, 8). Some 
polar bears spend this period on shore, where 
foraging is also usually limited (P). 

To reduce the loss of body condition during 
summer food deprivation, shore bears purport- 
edly enter a state of lowered activity and resting 
metabolic rate similar to winter hibernation but 
without denning {10). This “walking hiberna- 
tion” could partially compensate for the nega- 
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tive impacts of extended ice melt {IT). However, 
in western Hudson Bay, Canada, shore bears lose 
body mass at a rate indicative of typical, rather 



than hibemation-like, metabolism {12). The phys- 
iological state of bears that follow the retreating 
sea ice into the central Arctic basin in summer 
is unknown. In addition, recent sea-ice loss may 
be increasing the frequency of long-distance 
swims by polar bears {13), during which they 
risk losing over 10 times more heat than they 
produce (supplementary text) because their fur 
loses 90% of its insulation value when wet {14), 
and their subcutaneous fat does not provide 
blubber-like insulation {15). 

To understand polar bear responses to these 
challenges of summer ice melt, we investigated 
activity on shore (2008 and 2009) and on ice 
(2009) in the Beaufort Sea (Fig. 1) by affixing 
telemetry transmitters and activity loggers {16) 
to 25 females (mean age = 10 years + 1 SE, age 
range = 4 to 20 years) and one male (age 3). We 
recorded temperatures of the body core (an 
index of metabolic rate) {17) and periphery by 
implanting loggers into the abdomens (core) of 
10 bears (nine females, mean age = 11 years + 2 SE, 
age range = 3 to 23 years; one male, age 6) and 
the rumps (periphery) of seven other individ- 
uals (six females, mean age = 9 years + 2 SE, age 
range = 5 to 20 years; one male, age 2). 



Fig. 1. The western Arctic on (A) 11 May 2009 and (B) 31 August 2009. Locations are shown for ice 
polar bears (blue circles), shore polar bears (red squares), and whale carcasses (triangles). The 300-m 
depth contour is shown as a dashed line. Sea ice is shown in white. 
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Bears on shore and ice exhibited similar ac- 
tivity patterns (Fig. 2A and tables SI and S2; in 
table SI, shore data from 2008 and 2009 were 
pooled for lack of difference). Their time spent 
active peaked at -25% between May and July, 
then fell to 12 to 22% between August and 
October (Fig. 2A), remaining greater than the 
values of 1 to 2% previously observed during 
winter hibernation (18). The maximum activity 
level we measured (-25%) is approximately 
half that observed in all other bear species (19), 
perhaps reflecting polar bears’ specialization 
for hunting large vertebrate prey (3). The high 
energy costs of finding and subduing such prey 
can be reduced by ambush tactics (20), such as 
still-hunting at seal breathing holes (3). 

Although bears on shore and ice were simi- 
larly active between August and October, move- 
ment rates were higher for ice bears (Fig. 2B 
and table SI), potentially because the sea ice 
underfoot was drifting at 0.30 to 0.60 m/s (21). The 
proportion of rapid movement rates (>0.33 m/s) 
recorded when ice bears were resting (i.e., mo- 
tionless for >98% of the previous half hour) (16) 
increased in late summer (Fig. 2C), suggesting 
that as bears reduced their activity, their move- 
ment rates increasingly reflected ice drift. 

We did not find support for the hypothesis 
that lowered activity of shore and ice bears is a 
response to decreased food availability. For ice 
bears, location relative to primary seal habitat 
over the continental shelf (7, 8) was a poor pre- 
dictor of activity (mean model coefficients over- 
lapped with zero; n = 23 bears) (table S3). This 
result suggests that additional factors deter- 
mine seal distribution (e.g., fine-scale variation 
in productivity) (8) or their availability as prey 
(e.g., time spent on the ice surface) (4). Similar- 
ly, access to concentrated food resources for 
shore bears [i.e., locations within 500 m of bow- 
head whale (Balaena mysUcetus) carcasses from 
Inuit subsistence harvest (table S4)] was not 
associated with activity levels (n = 1 bears) 
(table S3). 

Bears may reduce activity to avoid heat stress 
in summer, because their large body size and 
low ratio of surface area to volume hinder heat 
dissipation. In a previous study, captive polar 
bears became hyperthermic while walking on 
treadmills at speeds >1.6 m/s when air temper- 
ature was >-5°C (22). However, air temperature 
in our study (daily means: -14.8° to 15.1°C) was 
unrelated to activity of shore bears (n = 12 bears) 
(table S3). Also, only 31 of our 61,882 measure- 
ments of movement rate (shore and ice com- 
bined; 72 = 25 bears) were >1.6 m/s, indicating 
that free-ranging bears seldom walk that rapid- 
ly. Thus, locomotion-induced heat stress is prob- 
ably rare and insufficient to explain the reduction 
in summer activity. 

Core body temperatures (monthly means of 
smoothed data) (16) did not differ between shore 
and ice bears during summer (Fig. 2D, table SI, 
and fig. SI), suggesting that all bears had similar 
nonhibemating, resting metabolic rates (fig. S2A). 
Mean core temperatures of ice bears gradually 
declined from May (37.3°C) to September (36.6°C) 



(Fig. 2D). It is unclear whether a similar trend 
occurred in shore bears, because they were im- 
planted with loggers in August. 

These gradual temperature declines corre- 
lated with activity (mean r = 0.31, n = 9 bears) 
(table S5) (16) and may have been associated 
with fasting (5). Fasting can cause progressive 
reduction in body temperature of -1°C in mam- 
mals (23, 24), corresponding with up to -20% 
decreases in whole-body metabolic rate (23-26). 
However, the reduction in mass-specific meta- 
bolic rate is smaller and sometimes nonexistent 
after the loss of metabolically active tissue is 
considered (24). Unfortunately, we were unable 
to assess mass changes because we captured 
bears in spring before they reached peak mass 
(table S6). Reduced insulation from thinning 
of fur and subcutaneous fat could also cause 
temperature declines, although warm summer 
conditions could counteract insulation loss. 
Hence, gradual declines in the summer core tem- 
perature of polar bears suggest reductions in 
energy expenditure typical of food-deprived mam- 
mals (24). 



Data from one pregnant female that retained 
her logger through January provide evidence that 
polar bears in maternal dens exhibit hibernation 
core temperatures in winter. In contrast to the 
gradual declines observed in summer, her core tem- 
perature abruptly fell to -35°C after 28 November 
(Fig. 2D and fig. S2B), as would be expected after 
parturition (3, 26). Such low temperature presum- 
ably reflects a 50 to 80% reduction in metabolic 
rate during winter hibernation, similar to that 
seen in other ursids (fig. S2A) (26, 27). 

Core temperatures, like movement rates, in- 
dicated that polar bears did not experience heat 
stress in summer. When walking on a treadmill 
at >1.6 m/s, captive polar bears frequently ex- 
hibited temperatures >39.0°C, including uncon- 
trolled rises to >40.0°C, leading to the conclusion 
that they store excess heat and are inefficient 
walkers (22). However, only 18 of our 27,843 mea- 
surements (n = 10 bears) were >39.0°C and none 
were >40.0°C. This suggests that polar bear heat 
storage and locomotion efficiency should be re- 
assessed and that polar bears thermoregulate 
effectively during summer. 
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Fig. 2. Polar bear activity and 
abdominal temperature. (A and B) 

Grand means (±SE) of activity and 
movements for bears on sea ice 
(circles) and on shore (squares), 
April to March (2008-2010). (C) 
Proportion of movement rates 
>0.33 m/s recorded when bears 
were inactive. (D) Individual means 
(±95% Cl) from May to October for 
ice bears (except location unknown 
for bear 20132) and from August to 
October for shore bears. The inset 
depicts grand monthly means from 
August to October on ice (circles) 
and on shore (squares). Sample 
sizes are in table S7 (n = 1 for July 
on shore): raw data are in fig. SI. 
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We also observed brief bouts of cold core 
temperatures (<35.0°C, typically <12 hours) as 
low as 22.3°C. These temperature changes were 
too rapid (up to +5.0°C per hour) to represent 
fluctuations in whole-body metabolic rate. Such 
bouts occurred in five of five shore bears and one 
of four ice bears. Similar peripheral temperature 
changes were recorded by rump loggers in six 
shore bears and one ice bear, although maximum 
hourly swings were greater for the periphery 
(means: +11.8°C and -10.4°C) than for the core 
(means: +5.0°C and -5.0°C) (supplementary text). 

It is unlikely that cold bouts of peripheral and 
core temperatures reflected consumption of ice 
or lying on it, as some occurred during above- 
freezing air temperature when sea ice was ab- 
sent. Instead, cold bouts appeared to be caused 
by two modes of regional heterothermy: (i) cool- 



Fig. 3. Body temperature of a 
swimming polar bear. (A) 

Hourly abdominal (core) tem- 
perature of bear 20414 in 2009. 
A bout of cold temperatures 
(arrow) is expanded in (B). 

(C) Temperatures from the col- 
lar and a weather station 110 km 
away during the same cold 
bout. (D) Collar acceleration 
scores during the same cold 
bout. (B) through (D) are 
divided into time periods 1 to 5. 
In period 1, stationary locations 
and high collar temperatures 
indicate resting on shore and 
covering the collar. In 2, loca- 
tions were offshore, collar tem- 
peratures were consistent with 
immersion, and acceleration 
suggests swimming. In 3 
through 5, locations indicate 
walking along the coast, resting, 
then walking again. (E) Mean 
activity (±95% Cl) measured 
during bouts of cold temper- 
atures (<35.0°C) recorded from 
abdominal (n = 69, three bears 
pooled) and rump (n = 259, one 
bear) loggers. (F) Mean collar 
temperature (±95% Cl) 
measured during bouts of cold 
temperatures (<35.0°C) 
recorded from abdominal 
(n = 68, three bears pooled) 
and rump (n = 829, six bears 
pooled) loggers. 



ing the body periphery during inactivity and (ii) 
cooling the periphery and part of the core while 
swimming (Fig. 3, A to D and fig. S3). Support- 
ing this distinction, peripheral temperatures fell 
below 35.0°C during both inactivity (49 of 800 mea- 
surements, n = 3 bears) and swimming (572 
of 741 measurements, n = 3 bears). In contrast, 
core temperatures fell below 35.0°C only during 
swimming (6 of 11 measurements, n = 1 bear) 
(supplementary text) and never during inactivity 
(0 of 353 measurements, n = 1 bear). Further- 
more, cold peripheral temperatures (<35.0°C) 
were associated with lower activity and warmer 
collar temperatures (i.e., when the bear curled 
up, warming the collar sensor) than were cold 
core temperatures (Fig. 3, E and F). 

Regional heterothermy of the body core is 
unusual (17) and may minimize heat loss while 



swimming. Immersed polar bears probably re- 
duce skin temperature to several degrees above 
the surrounding water (supplementary text) (75), 
similar to seals (28). Our data suggest that bears 
maintain an internal thermal gradient by tem- 
porarily cooling the outermost tissues of their 
core to form an insulating shell. A similar pro- 
cess occurs in diving king penguins (Aptenodytes 
patagoniciis), another polar endotherm without 
blubber (29). Control of this process is probably 
active (e.g., via vasoconstriction), because ab- 
dominal loggers cooled more quickly than is 
possible passively, and temperatures were sub- 
sequently regulated (Fig. 4 and supplementary 
text). This regional heterothermy may represent 
an adaptation to long-distance swims (13), al- 
though its limits remain unknown. One of our 
bears survived a 9-day swim, but when recap- 
tured 7 weeks later, she had lost 22% of her 
body mass and her cub (30). 

Sea-ice loss (1) increasingly limits spring and 
summer hunting opportunities for polar bears 
in parts of their range (2). In the Beaufort Sea 
and elsewhere (2, 31), this has reduced the en- 
ergy stores available for bears during subse- 
quent food deprivation (2, 5). We found that 
both core temperature and activity remained 
above values observed during winter hiberna- 
tion. The gradual declines in core temperature 
during summer suggest a typical mammalian 
response to fasting, which offers limited to no 
energy savings based on mass-specific meta- 
bolic rates (24). Thus, our data indicate that 
bears cannot use a hibernation-like metabolism 
to meaningfully prolong their summer period 
of fasting and reliance on energy stores. In 
conjunction with theoretical models linking nor- 
mal metabolic rate to depletion of stored energy 
and mortality (32), our findings suggest that 
bears are unlikely to avoid deleterious declines 
in body condition, and ultimately survival, that 
are expected with continued ice loss and length- 
ening of the ice melt period (2). 




Fig. 4. Cooling curves of a polar bear. Hourly ab- 
dominal (core) temperatures [06:00 (UTC -08:00) 
on 2 October to 18:00 on 3 October 2009] represent 
swimming based on sparse location data (solid cir- 
cles). Predicted intraperitoneal temperatures (open 
circles) represent cessation of heat production 
(death) at the asterisk and subsequent immer- 
sion in 4°0 water (supplementary text). 
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THERMAL PHYSIOLOGY 

Keeping cool: Enhanced optical 
reflection and radiative heat 
dissipation in Saharan silver ants 

Norman Nan Shi,^ Cheng-Chia Tsai/ Fernando Camino,^ Gary D. Bernard/ 

Nanfang Yu/* Rudiger Wehner^* 

Saharan silver ants, Cataglyphis bombycina, forage under extreme temperature conditions 
in the African desert. We show that the ants’ conspicuous silvery appearance is created 
by a dense array of triangular hairs with two thermoregulatory effects. They enhance not 
only the reflectivity of the ant’s body surface in the visible and near-infrared range of 
the spectrum, where solar radiation culminates, but also the emissivity of the ant in 
the mid-infrared. The latter effect enables the animals to efficiently dissipate heat back 
to the surroundings via blackbody radiation under full daylight conditions. This biological 
solution for a thermoregulatory problem may lead to the development of biomimetic 
coatings for passive radiative cooling of objects. 



T he silver ants of the Sahara desert, 
Cataglyphis bombycina, inhabit one of 
the hottest terrestrial environments on 
Earth, where they occupy the ecological 
niche of a “thermophilic scavenger” (7). In 
wide-ranging foraging journeys, they search for 
corpses of insects and other arthropods that have 
succumbed to the thermally harsh desert condi- 
tions, which they themselves are able to with- 
stand more successfully. On hot summer days, 
they may reach maximal foraging activities when 
temperatures of the desert surface are as high as 
60° to 70°C and their body temperatures measured 
as “operative environmental temperatures” are in 
the range of 48° to 51°C (2, 3). In order to survive 
under these conditions, occasionally the ants 
must unload excess heat by pausing on top of 
stones or dry vegetation, where, because of the 
steep temperature gradient above the sand sur- 
face, they encounter considerably lower tem- 
peratures. Under the midday sun of a summer 
day, the ants may resort to this thermal respite 
(cooling ofO up to 70% of their entire foraging 
time (3). In keeping their body temperature 
below their critical thermal maximum of 53.6°C 
(4), they need not only to reduce heat absorp- 
tion from the environment but also to be able to 
efficiently dissipate excess heat, so that they can 
minimize the amount of time spent in thermal 
refuges. 

As we showed, through a series of optical and 
thermodynamic measurements, full-wave simu- 
lations, and heat-transfer modeling, a dense array 
of triangularly shaped hairs, characteristic of 
(Cataglyphis bowJyycina, enables the ants to main- 
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tain lower body temperatures by (i) reflecting a 
large portion of the solar radiation in the visible 
and near-infrared (NIR) range of the spectrum 
and (ii) radiating heat to the surrounding envi- 
ronment by enhancing the emissivity in the mid- 
infrared (MIR), where the blackbody radiation 
spectrum of the ant’s body culminates. The thermo- 
regulatory solutions that the silver ants have 
evolved to cope with thermally stressful condi- 
tions show that these animals are able to con- 
trol electromagnetic waves over an extremely 
broad range of the electromagnetic spectrum 
(from the visible to the MIR) and that different 
physical mechanisms are employed in different 
spectral ranges to realize an important biological 
function. 

Specimens of Cataglyphis borribycina collected 
in Tunisia (34°10'N, 08°18'E) were used for all of 
the optical and thermodynamic measurements. 
In these ants, the dorsal and lateral sides of the 
body have a silvery glare (Fig. lA) and are cov- 
ered by dense and uniform arrays of hairs (Fig. IB 
and fig. S4). As scanning electron microscopy 
(SEM) images show, the hairs, which gradually 
taper off at the tip, are locally aligned in the same 
direction (Fig. 1C). Their most remarkable struc- 
tural feature is the triangular cross-section char- 
acterized by two corrugated top facets and a flat 
bottom facet facing the ant’s body (Fig. 1, D and 
E). Cross-sectional views obtained by focused ion 
beam (FIB) milling show that the gap between 
the bottom hair facet and the cuticular surface 
also varies but is generally larger than a few 
hundred nanometers. 

Optical reflectivity measurements of ant speci- 
mens were obtained with two Fourier transform 
spectrometers, one collecting spectra in the visible 
and NIR (wavelengths from 0.46 to 1.7 pm) and the 
other in the MIR (wavelengths from 2.5 to 16 pm). 
The visible and NIR measurements showed 
that hemispherical reflection [i.e., the sum of 
specular and diffuse reflection collected through 
an integrating sphere (2)] is substantially enhanced 
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in regions with intaet hair coverage as compared to 
regions from which the hairs had been removed 
(Fig. 2A and fig. SI). The hair-covered region re- 
flects 67% of the incoming solar radiation rather 
than only 41%, as is the case after their removal. 
This enhancement is due to scattering within the 
triangular hairs (Mie scattering), where light gets 
trapped and then reradiates out in all directions 
(5-7). Individual hairs of given cross-sectional di- 
mensions generate enhanced reflection due to 
scattering at specific wavelengths where funda- 
mental and higher-order Mie resonance modes 
are supported (Fig. 2C and fig. S5). Due to the 
variation in cross-sectional areas, resonance peaks 
from individual hairs are averaged out, so that 
the hair cover effectively acts as a coating with 
enhanced broadband reflection. 

Because of the ellipsoidal shape of the ant’s 
body, about two-thirds of the dorsolateral surface 
is obliquely hit by solar radiation (fig. S4). This 
prompted us to examine the reflectivity as a 
function of the incidence angle of radiation, which 
was varied from 0° to 80° with 0° representing 
the direction normal to the surface. As the results 
show, Mie scattering enhances reflectivity over all 



angles when regions with intact hair cover are 
compared to those with hairs removed (Fig. 2B). 
With increasing angle of incidence, reflectivity 
enhancement becomes particularly strong at be- 
yond 30°. This is the critical angle at which total 
internal reflection starts to occur at the bottom 
facets of the hairs (Fig. 2D, II). At angles ap- 
proaching 90° reflectivity drops off when total 
internal reflection at one of the top facets starts 
to direct more of the radiation toward the ant’s 
body (Fig. 2D, III). 

Next, we performed finite-difference time- 
domain (FDTD) simulations in order to demon- 
strate the functional significance of the triangular 
cross section of the hairs in enhancing reflectivity 
in the visible and NIR ranges (figs. S5 to S8). 
These simulations compared the reflective prop- 
erties of triangular and circular hairs of the same 
cross sectional area. Even though the enhance- 
ment of reflectivity at normal incidence is com- 
parable in both cases, triangular hairs produce 
an extra enhancement at higher angles of inci- 
dence (Fig. 2B). The reason is that although Mie 
scattering of similar strength occurs in both cir- 
cular and triangular hairs, in the latter the total 



internal reflection at the bottom facets of the 
hairs enhances reflectivity substantially further. 
The high reflectivity disappeared when the spec- 
imens were wetted by an ethanol-water solution 
(fig. S3), which removed the refractive index con- 
trast between air and hairs and thus destroyed 
the conditions required for Mie scattering and 
total internal reflection. 

Reflectivity measurements performed in the 
MIR range revealed a second important point 
in the silver ant thermotolerance story. When 
proceeding from lower to higher wavelengths, at 
about 8 pm, the enhanced reflectivity of regions 
with hair cover as compared to those without 
hairs reverses to reduced reflectivity (Fig. 2E). 
As Kirchhoffs law of thermal radiation states, re- 
duced reflectivity corresponds to enhanced emis- 
sivity ( 8 ). At a body temperature of 50°C, which 
the silver ants may reach when foraging at peak 
activity times, the blackbody radiation of the 
ant’s surface would lie in the range of 6 to 16 pm 
(peaked at ~9 pm) and thus allow the silver ants 
to offload heat more efficiently through radiative 
heat transfer. The latter decreases the steady-state 
body temperature and reduces the respite time. 




Fig. l.The bright glare of the silver ant and its structural basis. (A) Silver ant offloading heat on top of dry vegetation ( 18 ). (B) SEM frontal view of the head 
densely covered by hairs. (C) SEM image of the hairs gradually tapering off toward the tip. (D) Cross-sectional view of the hairs milled with FIB. (E) SEM image of 
two hairs with one flipped upside down to exhibit the flat bottom facet. 
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Fig. 2. Reflectivity of the silver ant’s body sur- 
face from the visible to the MIR range of the 
spectrum. (A) Hemispherical reflectivity measured 
in the visible and NIR. (B) Measurement and sim- 
ulation results showing visible and NIR reflectivity as 
a function of incidence angle. (C) Cross-sectional 
view of a two-dimensional distribution of a light field 
(magnitude of electric field component of light, or 
|E|) around a triangular hair for three exemplary Mie 
resonances. (D) Schematic diagram showing the in- 
teraction between visible and NIR light and a hair at 
small (I), intermediate (II), and large (III) incidence 
angles. The corrugated upper two facets may en- 
hance diffuse reflection in the ultraviolet and visible 
ranges. (E) Reflectivity measured in the MIR at normal 
incidence. (F) Simulated MIR reflectivity as a function 
of incidence angle. 
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How does the hair cover with its enhanced 
reflectivity in the visible and NIR and its enhanced 
emissivity in the MIR affect the radiative heat 
transfer between the ant’s body and the environ- 
ment? To investigate this question, we performed 
thermodynamic experiments, which mimicked all 
radiative heat transfer effects in the silver ants’ 
natural foraging environment (fig. S2). To accom- 
plish this task, we used a high-power xenon lamp 
to simulate the solar spectral distribution at the 
desert sand surface (9) and a thermoelectrically 
cooled high-emissivity metal plate to simulate the 
clear sky with its low level of blackbody radia- 
tion (10). The ant specimens were suspended on 
thin threads to minimize thermal conduction. 
Thermodynamic experiments were conducted in 
vacuum to study thermal radiation, as well as 
in still air to study the interplay of thermal ra- 
diation and convection. Under both conditions, 
the specimens with their natural hair covers 
were able to maintain significantly lower steady- 
state body temperatures than the same speci- 
mens with the hairs removed (Fig. 3). 

The thermodynamic experiments conducted 
in vacuum comparing specimens before and af- 
ter hair removal further revealed that the hair 



cover decreases the time constants of temperature 
change (Fig. 3, B and E). The shortened time 
constants indicate an increased rate of radiative 
heat transfer and are a direct confirmation of the 
effect of the hairs in enhancing the MIR emis- 
sivity. By using the time constants and the heat 
transfer model, we computed that the hair cover 
enhances emissivity by about 15% (table SI). This 
enhanced emissivity is due to the fact that at 
large MIR wavelengths (i.e., at wavelengths much 
larger than the dimensions of the cross sections 
of the hairs), the hair structure acts as a gradient 
refractive index layer (fig. S9) (11-13), which pro- 
vides the surface with broadband, broad-angle 
antireflective properties in the MIR (Fig. 2, E and 
F). Because of the influence of convection, the 
time constants of temperature change decreased 
by a factor of about 3 when the specimens were 
brought from vacuum into air (Fig. 3). This in- 
dicates that radiative heat dissipation amounted 
to about one-half of convection and, therefore, 
still played a significant role in the presence of 
natural convection. 

Applying experimentally extracted parameters 
to the heat transfer model revealed that the en- 
hanced visible and NIR reflectivity and enhanced 



MIR emissivity make comparable contributions 
to reducing the steady-state temperature in the 
presence of natural convection (figs. SIO and Sll). 
On hot summer days in the Sahara, the foraging 
activities of silver ants often occur under low wind 
or even still air conditions, when the ants have to 
rely on enhanced visible and NIR reflectivity and 
enhanced MIR emissivity equally heavily to re- 
duce their body temperature during the respite 
behavior. 

It is interesting to note that the hairs cover 
only the top and the sides of the ant’s body, where 
they are responsible for the effects described 
above. The absence of hairs on the bottom sur- 
face reduces the radiative energy transfer between 
the hot sand and the cooler ant body, so that the 
animals can reduce the absorption of blackbody 
radiation from the desert floor. 

In conclusion, Saharan silver ants are covered 
with a dense array of triangular hairs on the top 
and sides of their bodies. These silvery hairs 
protect the ants against getting overheated in at 
least three ways. First, as a result of Mie scat- 
tering and total internal reflection, the hairs 
enhance reflectivity in the visible and NIR, where 
solar radiation culminates. Second, in the MIR, 
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Fig. 3. Results of thermodynamic experiments. 

(A) Thermal camera images showing the head of 
an ant specimen at the thermal steady state un- 
der different conditions. Temporal temperature 
profiles measured for the head before and after 
hair removal in vacuum (B) and in still air (C) are 
shown. (D to F) Results obtained for the hind part 
(gaster) of an ant specimen. Insets in (B) and (E) 
are photos of specimens before and after hair 
removal. In the “hairs intact” pictures of head and 
gaster, because of the limited solid angle of illu- 
mination, the silvery glance is not shown all over 
the body surface portrayed in the figures. 



where solar radiation becomes negligible for 
wavelengths >2.5 gm, the hairs acting as an 
antireflection layer enhance emissivity and thus 
increase the ants’ ability to offload excess heat via 
blackbody radiation. Third, the ants’ bare bottom 
surface reflects MIR radiation from the hot des- 
ert floor more efficiently than if it were covered 
by hairs. Taken together, these effects result in 
decreasing the ants’ steady-state body temperature 
and thus enable these thermophilic scavengers to 
forage at exceedingly high environmental tem- 
peratures. Finally, the present interdisciplinary 
account on the silver ants could have a signifi- 
cant technological impact by inspiring the de- 
velopment of biomimetic coatings for the passive 
cooling of objects ( 14 - 16 ). A recent article re- 
ported the demonstration of a multilayered film 
that can cool down an object by using essentially 
the same mechanisms as the silver ants: high re- 
flectivity in the solar spectrum and high emissivity 
in the MIR ( 17 ). 
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PLANT ECOLOGY 

Worldwide evidence of a unimodal 
relationship between productivity 
and plant species richness 
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The search for predictions of species diversity across environmental gradients has 
challenged ecologists for decades. The humped-back model (HBM) suggests that plant 
diversity peaks at intermediate productivity; at low productivity few species can tolerate 
the environmental stresses, and at high productivity a few highly competitive species 
dominate. Over time the HBM has become increasingly controversial, and recent studies 
claim to have refuted it. Here, by using data from coordinated surveys conducted 
throughout grasslands worldwide and comprising a wide range of site productivities, we 
provide evidence in support of the HBM pattern at both global and regional extents. The 
relationships described here provide a foundation for further research into the local, 
landscape, and historical factors that maintain biodiversity. 



D espite a long history of research, the nature 
of basic patterns between environmental 
factors and biological diversity remain 
poorly defined. A notable example is the 
relationship between plant diversity and 
productivity, which has stimulated a long-running 
debate (1-6). A classic hypothesis, the humped- 
back model (HBM) (7), states that plant species 
richness peaks at intermediate productivity, tak- 
ing above-ground biomass as a proxy for annual 
net primary productivity (7-9). This diversity peak 
is driven by two opposing processes. In unproduc- 
tive ecosystems with low plant biomass, species 
richness is limited by abiotic stress, such as in- 
sufficient water and mineral nutrients, which 
few species are able to tolerate. In contrast, in the 
productive conditions that generate high plant 
biomass, competitive exclusion by a small num- 
ber of highly competitive species is hypothesized 
to constrain species richness (7-9). Other mech- 
anisms that may explain the unimodal relation- 
ship between species richness and biomass include 
disturbance (7, 10), evolutionary history and dis- 
persal limitation (11, 12), and the reduction of total 
plant density in productive communities (13). 

Since its initial proposal, a range of studies 
have both supported and rejected the HBM, and 
three separate meta-analyses reached different 



conclusions (14-17). Although this inconsistency 
may indicate a lack of generality of the HBM, it 
may instead reflect a sensitivity to study meth- 
odology, including the plant community types 
considered, the taxonomic scope, the range of 
site productivities sampled, the spatial grain and 
extent of analyses (17, 18), and the particular 
measure of net primary productivity used (19). 
The questions therefore remain open as to what 
the form of the relationship between diversity 
and productivity is, and whether the HBM serves 
as a useful and general model for grassland eco- 
system theory and management. 

We quantified the form and the strength of the 
richness-productivity relationship by using glob- 
ally coordinated surveys (20), which yielded scale- 
standardized data and were distributed across 
30 sites in 19 countries and six continents (Fig. 1). 
Collectively, our samples spanned a broad range 
of biomass production (from 2 to 5711 g m“^) and 
grassland community types, including natural and 
managed (pastures and meadows) grasslands 
over a v^de range of climatic zones (temperate, 
Mediterranean, and tropical), and altitudes Qow- 
land to alpine) (table SI). Our protocol involved 
sampling 64 1-m^ quadrats vathin 8-m-by-8-m 
grids (18, 21). At each site, between 2 and 14 grids 
were sampled, thus resulting in 128 to 896 quad- 



rats per site. In each 1-m^ quadrat, we identified 
and counted all plant species and harvested above- 
ground biomass and plant litter. Litter production 
is a function of annual net primary productivity 
in grasslands and can have profound effects on 
the structure and functioning of communities, 
from altering nutrient cycling to impeding vege- 
tative growth and seedling recruitment (22, 23), 
thereby also playing a major role in driving com- 
munity structure. Indeed, the HBM was origi- 
nally defined in terms of live biomass plus litter 
material (7, 8). Most of the sites in our survey 
were subject to some form of management, usually 
livestock grazing or mowing. In this respect, 
our sites are representative of most of the world’s 
grasslands. Our sampling was conducted at 
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least 3 months after the last grazing, mowing, or 
burning event and at the annual peak of live bio- 
mass, which, when coupled with litter, constitutes 
a reliable measure of annual net aboveground pro- 
duction in herbaceous plant communities (24). 

Our results strongly support the HBM of the 
plant richness-productivity relationship. By using 
a global-extent regression model (N = 9631 1-m^ 
quadrats) (21), we found that plant richness 
formed a unimodal relationship with productivity 
(Fig. 2A) that is characterized by a highly sig- 
nificant concave-down quadratic regression [neg- 
ative binomial generalized linear model (GLM); 
Table 1]. This relationship was not sensitive to 
the statistical model used; the hump-backed re- 
lationship was also evident when we used a neg- 
ative binomial generalized linear mixed model 
(GLMM) that accommodated the hierarchical 
structure of our sampling design (grids nested 
within sites; Table 1 and fig. SI). 

At a sampling grain of 1 m^, 19 of 28 site level 
analyses (68%) yielded significant concave-down 
relationships (table S2 and Fig. 2A). This con- 
trasts markedly with the results of Adler et al (1), 
who found only 1 of their 48 within-site analyses 
to be significantly concave-down. We also found 
the form of the productivity-diversity relation- 
ship to be robust to sampling grain: by using 
grains of 1 m^ up to 64 m^, each time main- 
taining a global extent, we consistently found a 
significant concave-down relationship, although 
the proportion of variation explained tended to 
decrease with increasing grain (fig. S2). 

The HBM predicts a boundary condition or 
upper limit to diversity that, in any given site. 



may not be realized for a variety of reasons (18). 
Consistent with this view, our global-extent as- 
sociation is characterized by a significant concave- 
down quantile regression (95th percentile) (Table 
1), below which considerable scatter exists (Fig. 
2A). This pattern was also insensitive to the 
statistical method used; a hierarchical Bayesian 
analysis that accommodated the nested sampling 
design and that enabled both the mean and the 
variance of species richness to be modeled more 
accurately against Gog-transformed) biomass also 
revealed a significant 95th percentile quantile 
regression (fig. S3). Likewise, we found a signifi- 
cant, concave-down quantile regression (95th per- 
centile) between the maximum (quadrat-scale) 
richness found within a grid and the total biomass 
of the same quadrat (Table 1 and fig. S4). Each of 
these approaches to characterizing boundary con- 
ditions suggests the existence of a “forbidden 
space,” wherein high productivity precludes high 
local diversity. Furthermore, they suggest that 
extremely low-productivity sites rarely accommo- 
date high diversity. 

Why do our data show a hump-backed rela- 
tionship, whereas those of Adler et al. (1) and 
related studies (4, 6), do not? One possibility is 
that data limitations can thwart detection of the 
HBM (18). For example, the data used by Adler 
et al. differed from ours in the following poten- 
tially important ways: (i) They exhibited a maxi- 
mum live biomass of only 1535 g"^ (ours was 3374 g"^), 
(ii) litter was not included within the calculation 
of biomass, and (iii) sample size was limited to 30 
quadrats per site (ours ranged from 128 to 894 
quadrats per site; table SI). We conducted a form 



of sensitivity analysis in which we reran our sta- 
tistical analyses using random subsets of our 
data that were constrained to exhibit similar 
properties to those of the Adler et al. data set. 
Specifically, after limiting the overall data set to 
less than 1535 g"^ and excluding litter, we ran- 
domly selected 30 quadrats per site 500 times, 
each time conducting the within-site regression 
analyses (N = 30 for each of the 28 site-level 
GLMs conducted per subsampling iteration). For 
each iteration, we also calculated the average 
range of biomass spanned by the 28 site-level 
relationships. Across the 500 iterations (one 
example set of outcomes is shown in Fig. 2B), 
the average proportion of significant concave- 
down, within-site regressions was 0.31 + 0.003 
(SEM), significantly less than our observed pro- 
portion of 0.68 (fig. S5). Moreover, when signif- 
icant concave-down relationships were detected, 
they tended to span a broader range of biomass 
than the remaining forms (including nonsignifi- 
cant relationships). Specifically, in 458 of the 500 
iterations (92%), the mean biomass range of the 
concave-down regressions was larger than the 
mean of the remaining forms’ biomass ranges 
(binomial test: P < 2.2 x 10"^^). Last, the 48 
within-site analyses of Adler et al. spanned, on 
average, a live biomass range of 428.7 g"^ + 38.36 
(range of 89 to 1217 g"^). This is (i) less than half 
of the average range encompassed by our 28 site- 
level analyses shown in Fig. 2A (mean = 1067.5 g"^ + 
140.63; range of 286 to 3256 g"^) and (ii) almost 50% 
narrower than the smallest average biomass range 
encompassed by our 500 random subset analyses 
(627.4 g"^) (fig. S6). Taken together, these findings 




Fig. 1. Site locations. Locations of the geographic centroids of the 30 study sites, which include 151 sampling grids. Some points overlap and are therefore 
indistinguishable. Additional site details are provided in table SI. Map is displayed using the Robinson projection. 
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Strongly suggest that we were able to detect 
more concave-down relationships because of the 
greater sample sizes and biomass ranges in our 
analysis. 

It has been suggested (2) that some previous 
studies, including Adler et al (1), failed to support 
the HBM because they excluded litter. Although 
we do find a significant concave-down relation- 
ship at the global extent using only live biomass 
(Table 1), a comparison of models using biomass 
versus biomass and litter (both N = 9,631) shows 



total biomass to provide a far better fit [residual 
deviance = 10,105 Give) versus 10,037 (total); Vuong 
.8:-statistic for comparing non-nested models: -13.4; 
P < 0.001]. It has also been suggested that previ- 
ous surveys failed to adequately represent high- 
productivity communities. Indeed, our high-biomass 
quadrats (1011 samples were over 1000 g"^, -10% 
of the 9631 samples; maximum of 5711 g"^) con- 
tributed considerably to the right-hand part of 
the fitted humped-back regression. This could be 
a reason why the data set of Adler et al. (7) (in 



which only 0.5% of samples were over 1000 g"^ 
with a maximum of 1534 g“^) failed to support 
the HBM. Our results therefore show that a test 
of the HBM in herbaceous plant communities 
yields the expected pattern when it is robust and 
comprehensive, spans a wide range of biomass 
production (from 1 to at least 3000 dry g"^ year“^), 
and provides sufficient replication of quadrats 
along the productivity gradient. 

Competitive exclusion has been cited as the 
primary factor driving low species numbers at 




Fig. 2. Biomass production as a function of species richness. (A) 

Biomass production-species richness relationships for 28 study sites. 
Solid black line indicates significant quantile regression (95th percentile) 
of overall relationship (quadratic coefficient P < 0.001; N - 9631 quadrats). 
Dashed black line, significant negative binomial GLM (quadratic coefficient 
P < 0.001; N = 9631). Colored lines indicate significant GLM regressions 
(Poisson or quasi-Poisson), with N ranging from 128 to 894 quadrats. 



(Inset) The frequencies of each form of relationship observed across study 
regions. NS, not significant. (B) Same as (A) but the results are derived 
from the analysis of an example, random subsample of the complete data 
set that satisfies the following criteria; litter biomass excluded, quadrats with 
biomass >1534 g~^ excluded, and including 30 (randomly selected) quadrats 
per site (total N - 840). These criteria match the characteristics of the data 
set used by Adler et al. (J). 



Table 1. Regression results. Results of regression analyses of the relationship between productivity and species richness, measured at a global extent and a 
sampling grain of 1-m^ quadrat. Total biomass = live biomass -i- litter biomass. All linear and quadratic term coefficients were highly significant (P < 0.001). 



Productivity measure 


Type of regression 


Sample size 


Test of model fit 


Intercept 


Linear term 


Quadratic term 










estimate + SEM 


coefficient + SEM 


coefficient + SEM 


Total biomass 


negative binomial 
GLM (log-link function) 


9631 

quadrats 


likelihood ratio 
stat. = 1602.2 


-2.52 ± 0.235 


4.69 ± 0.186 


-1.04 ± 0.037 


Total biomass 


negative binomial 


9631 












GLMM (log-link function) 
random effects: 


quadrats 
151 grids 


likelihood ratio 
stat. = 114.0 


0.91 ± 0.191 


1.33 ± 0.133 


-0.29 ± 0.028 




grid nested in site 


28 sites 










Total biomass 


quantile 

(95th percentile) 


9631 quadrats 


pseudo-F 
statistic = 179.1 


-12.9 ± 7.159 


45.6 ± 5.833 


-11.3 ± 1.173 


Live biomass 


negative binomial GLM 
(log-link function) 


9644 quadrats 


likelihood ratio 
stat. = 950.3 


-2.03 ± 0.212 


4.27 ± 0.178 


-0.96 ± 0.037 
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high plant biomass (7, 8, 2S). However, in the 
case of nitrogen addition the negative relation- 
ship between productivity and species richness 
has been shown to diminish over time \_(26), but 
see (27, 28)]. It may be that low species richness 
in high-productivity conditions arises in part be- 
cause most such habitats are anthropogenic, and 
there are few species in the local pool adapted to 
these conditions (11, 12). If so, it is possible that 
species will eventually immigrate from distant 
pools, so that the right-hand part of the hump 
will then flatten out. 

We have shown a global-scale concave-down 
unimodal relationship between biomass produc- 
tion and richness in herbaceous grassland com- 
munities. However, the original HBM (7) is vaguely 
articulated by the standards of modem ecological 
theory, and it is clear that more work is needed 
to determine the underlying causal mechanisms 
that drive the unimodal pattern (1, 6, 17, 18). We 
recognize that, in our study and many others, 
productivity accounts for a fairly low proportion 
of the overall variation in richness and that many 
other drivers of species richness exist (28-30). 
Accordingly, we echo the call of Adler et al. (1) for 
additional efforts to understand the multivariate 
drivers of species richness. 
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ICE SHEETS 

Reverse glacier motion during 
iceberg calving and the cause of 
glacial earthquakes 

T. Murray/* M. Nettles,^ N. Selmes/ L. M. Cathles/ J. C. Burton,^ T. D. James/ 

S. Edwards/ I. Martin/ T. O’Farrell/ R. Aspey/ I. Rutt/ T. Bauge’^ 

Nearly half of Greenland’s mass loss occurs through iceberg calving, but the physical mechanisms 
operating during calving are poorly known and in situ observations are sparse. We show that 
calving at Greenland’s Helheim Glacier causes a minutes-long reversal of the glacier’s horizontal 
flow and a downward deflection of its terminus. The reverse motion results from the horizontal 
force caused by iceberg capsize and acceleration away from the glacier front. The downward 
motion results from a hydrodynamic pressure drop behind the capsizing berg, which also causes 
an upward force on the solid Earth. These forces are the source of glacial earthquakes, globally 
detectable seismic events whose proper interpretation will allow remote sensing of calving 
processes occurring at increasing numbers of outlet glaciers in Greenland and Antarctica. 



O ne-third to one-half of Greenland’s total 
mass loss occurs through iceberg calving 
at the margins of tidewater-terminating 
glaciers (1, 2). Recent rapid changes in glacier 
dynamics are associated with increased 
calving rates (3-5) and increased rates of glacial 
earthquakes (6). At laige glaciers with near-grounded 
termini, calving typically occurs when buoyancy 
forces cause icebergs that are the full thickness of 
the glacier to capsize against the calving front (6-9). 
This type of calving is associated with glacial 
earthquakes (6, 7, 10), long-period seismic emis- 
sions of magnitude ~5 that are observed globally 
(11). These earthquakes have expanded north- 
ward and increased sevenfold in number during 



the past two decades (6, 12, 13), tracking changes 
in glacier dynamics, the retreat of glacier fronts, 
and increased mass loss (6, 14). Buoyancy-driven 
calving represents an increasingly important source 
of dynamic mass loss (6-8) as glacier fronts through- 
out Greenland have retreated to positions near 
their grounding lines (75). However, because of 
the difficulty of instrumenting the immediate near- 
terminus region of these highly active glaciers, few 
direct observations of the calving process are avail- 
able, limiting development of the deterministic 
calving models required for improved understand- 
ing of controls on dynamic ice-mass loss. Detailed 
knowledge of the glacial earthquake source would 
allow quantification of calving processes for a large 
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class of Greenland glaciers, as well as for glaciers in 
several regions of Antarctica (13). 

Agreement on the source mechanism of glacial 
earthquakes is limited. Analysis of long-period 
seismic data shows that a sub-horizontal force 
acts approximately perpendicular to the glacier 
calving front during the earthquakes (6, 13). The 
observed seismic signal is generated over a pe- 
riod of 1 min or more (6, 11, 16), much longer than 
the source duration for tectonic earthquakes of 
similar size (17). Some authors favor a model in 
which momentum transfer produces a force acting 
in the upglacier and then downglacier directions as 
a newly calved iceberg overturns, accelerates away 
from the calving front, and subsequently deceler- 
ates (6, 10, 13, 18). Others suggest that the seismic 
signal arises from the iceberg scraping along the 
calving front or :Qord bottom (7) or colliding with 
the glacier terminus (19). Hydrodynamic interac- 
tions with flord water may be important (20) but 
are little explored. Analytical investigations admit 
more than one possible mechanism for the earth- 
quakes (18), and no persuasive explanation has 
been presented for the vertical component of the 
earthquake force. We combined geodetic, seismic, 
and laboratory data to identify the forces acting 
during calving at large glaciers and to document 
the source of the associated seismic signals. 

We recorded geodetic data at the calving margin 
of Helheim Glacier (Fig. 1) (9), a major outlet of 
the Greenland Ice Sheet, during 55 days in July- 
September 2013. A wireless network of on-ice 
Global Positioning System (GPS) sensors (21) cap- 
tured glacier motion with centimeter-level accu- 
racy at a high temporal sampling rate in positions 
very close to the calving front (22). Hourly images 
from two cameras located ~4 km down-^ord from 
and looking at the calving front were used in stereo 
configuration to obtain the three-dimensional 
geometry of the calving front and calved icebergs 
(8, 22). Data from the global seismographic net- 
work were analyzed for the same time period to 
identify glacial earthquakes (13, 23) and obtain 
source parameters (11), including the orientation 
of the force active during the earthquake and the 
amplitude and centroid time, tc, of a centroid- 
single-force (CSF) history of prescribed shape (22). 

The glacier retreated -1.5 km in a series of 
calving events during the observing period. We 
identified 10 large calving events from the camera 
images. All coincided with glacial earthquakes; in 
two cases, two earthquakes occurred between sub- 
sequent images. During the earthquakes, the region 
near the calving front showed a dramatic reversal 
of flow, moving upglacier for several minutes 
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while simultaneously moving downward (Fig. 2 
and fig. SI). The horizontal and vertical motion 
then rebounded rapidly. 

Observations from a glacial earthquake occurr- 
ing on day of year (DOY) 206 at 03:13:47 UTC are 
shown in Fig. 2, A and C. Analysis of camera im- 
ages indicates ice loss of 0.461 + 0.009 km^ (Fig. 
1) at a location of ice thickness 0.79 km, yielding 
an iceberg volume of 0.36 km^ with an aspect ratio 
of 0.23. The earthquake had a CSF amplitude of 
0.24 X 10^"^ kg-m, with the force oriented 64°W 
(Fig. 1) and 9° above horizontal. GPS sensor 
1 (Fig. 1) showed a pre-earthquake flow speed 
of 29 m/day. Immediately before the earthquake 
centroid time, the sensor reversed its direction and 
moved upglacier at -40 m/day (displacement = 
9 cm) and downward (displacement = 10 cm). The 
reversed motion was sustained for -200 s and was 
followed by a downglacier rebound at -190 m/day 
(displacement = 20 cm) and upward movement 
(displacement = 16 cm) for -90 s. Similar tem- 
porally coincident signals were detected by near- 
by sensors 6 and 15 (Fig. 1 and fig. SI). 

Glacier deflection for a calving event on DOY 
212 (Fig. 1) is shown in Fig. 2, B and D. We ob- 
served similar responses for all glacial earthquake- 
iceberg calving events during which GPS sensors 
recording data of adequate quality were located 
within 500 m of the calved block (a total of nine 
glacial earthquakes and eight image pairs). These 
events occurred on DOY 205, 206 (three events), 
207, 211, 212, and 226 and were detected by multi- 
ple GPS sensors (further examples in fig. SI). 

The earthquake centroid times occurred at or 
near the end of the glacier’s rapid rebound phase, 
such that the upglacier earthquake force aligned 
in time with the reverse motion of the glacier. 
The horizontal glacier deflection is consistent 
with a model in which the reaction force on the 
glacier caused by seaward acceleration of the 
newly calved iceberg compresses the glacier front 
elastically. The front then rebounds as the force 
decreases and reverses polarity during iceberg 
deceleration. The glacier front thus acts as a 
spring, compressing and re-extending in phase 
with the applied force, which is the horizontal 
component of the seismic source. 

The downward deflection of the glacier front 
occurred in a region where vertical motion of the 
GPS sensors at tidal frequencies showed that the 
glacier is ungrounded and seawater is present 
beneath it. Iceberg rotation is likely to cause a 
low-pressure zone in the opening cavity between 
the iceberg and the glacier front. This pressure 
decrease would lower the load on the bedrock, 
resulting in an upward force acting on the solid 
Earth, as observed in our seismic analysis. A pres- 
sure decrease near the calving front would apply a 
net downward force on the glacier terminus, low- 
ering the glacier surface in a manner similar to 
that occurring twice each day when the ocean 
tides draw down the water level. At sensors ex- 
periencing earthquake deflections, we observed 
tidal variations in the glacier’s vertical position of 
-0.1 m per 1 m of tidal amplitude. The calving- 
related deflection of the glacier surface was -0.1 
to 0.16 m, suggesting a change in water pressure 



equivalent to a water-height change of -1 to 1.6 m, 
or roughly 1 to 2 x 10^ Pa. 

No observations of pressure or water-level var- 
iations are available from the region in the :Qord 
immediately in front of the glacier, where thick 
ice melange (Fig. 1) prohibits instrumentation. 
However, results from analog laboratory experi- 
ments allowed us to evaluate our inferences (22). 
A model glacier “terminus” was secured at one 
end of a water-filled tank, and plastic “icebergs” 
made from low-density polyethylene were placed 
flush against the terminus and allowed to capsize 
spontaneously under the influence of gravita- 
tional and buoyancy forces (24) (Fig. 3). Sensors 
embedded in the model glacier terminus moni- 
tored pressure in the water column and the force 
exerted on the terminus during iceberg capsize. 

The measured force on the terminus as the 
icebergs began to capsize was oriented in the 
upglacier direction and slowly increased as the ice- 
bergs rotated. As the icebergs neared horizon- 
tal, the force decreased rapidly. Pressure at the 
terminus decreased as the icebergs rotated, in- 
creasing again as the icebergs neared horizontal. 
Once the icebergs lost contact with the terminus, 
the measured force and pressure began to oscil- 
late as a result of induced wave action in the tank. 

We scaled up the measured forces and pres- 
sures to match the dimensions of icebergs calved 
at Helheim Glacier (Fig. 3). The laboratory data 
scale by powers of the ratio of the iceberg height 
in the field to the iceberg height in the laboratory 
(20, 24). The scaled peak force agreed well with 
typical values inferred from earthquake analysis 
(-10^^ N). The scaled peak pressure drop (-5 x 
10^ Pa) applied over an area corresponding to the 
iceberg’s map-view dimensions yielded an upward- 
directed force consistent with the seismically in- 
ferred vertical force component, such that the 
total force acting on the solid Earth was oriented 
-10° above horizontal. Computation and inver- 
sion of synthetic seismograms from the scaled 
force and pressure data confirmed the consistency 
of the laboratory model with real-world data. 

We used the scaled force and pressure to 
predict the deformation of the terminus region 
(22). The total force (F’tot) per unit area (Ap) act- 
ing on the calving region produces a horizontal, 
linear deflection orthogonal to the calving front, 
such that Ftot/AY = EAL/L, where E is the Young’s 
modulus of glacial ice. The value of L is chosen to 
provide the best match to the glacier position 
data. This length-scale probably represents the 
distance from the terminus to the grounding 
zone. We modeled the ungrounded section of the 
glacier as an elastic beam of length L loaded by 
the vertical force created by the pressure drop. 
The inferred distances L are a few kilometers, 
consistent with values estimated from GPS data. 

Glacier displacements predicted from the scaled 
laboratory data for iceberg dimensions corre- 
sponding to a calving event on DOY 206 (Fig. 1 and 
Fig. 2A) are shown in Fig. 3. Agreement with the 
observed glacier displacement was very good, 
particularly during the time over which the 
force acted in the upglacier direction (until the 
earthquake centroid time). After this time, the 
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laboratory-derived prediction was dominated by 
oscillations of the water column in the tank, which 
did not contain the thick layer of ice melange 
present in Helheim Fjord that would be expected 
to dampen such high-frequency oscillations. 



We conclude that as large icebergs rotate and 
accelerate away from the glacier calving front 
(Fig. 4), the reaction force— which is the horizontal 
component of the earthquake force— compresses 
the glacier front elastically, overcoming normal 



downglacier flow and temporarily reversing the 
motion of the glacier. Hydrodynamic interaction 
of the iceberg with the ^ord water rapidly reduces 
pressure behind the rotating iceberg, resulting in 
an upward force on the solid Earth that is the 

Fig. 1. Helheim Glacier, position of sensors, and 
seismic force directions. The location of GPS sen- 
sors and icebergs calved at Helheim Glacier (HH) for 
glacial earthquake events at 03:13 UTG on DOY 206 
2013 and 19:21 UTG on DOY 212 2013 are sup- 
erimposed on a Landsat 7 image from DOY 167 2013. 
“Affected” sensors exhibit earthquake-related deflec- 
tions. Scan-line-corrector failure stripes have been 
removed for clarity. Glacier flow is from left to right: 
bright white melange (a mix of iceberg fragments 
and sea ice) can be seen in front of the calving 
margin. Galving-front positions were obtained from 
photogrammetric digital elevation models derived 
from cameras. Positions are meters in Universal 
Transverse Mercator zone 24N. 







o 


GPS (unaffected) 


☆ 


GPS (affected) 


O 


Camera 


— 


Front position 
before calving 


— 


Front position 
after calving 


— ► 


Earthquake force 


Calved area 
Day 206, 03:13 
Day 212, 19:21 




532000 



536000 

Easting (m) 



540000 




Minutes from 



Fig. 2. Response of GPS sensors on glacier at 
the time of glacial earthquakes. (A) Sensor 1 at 
03:13 UTG on DOY 206 2013. (B) Sensor 9 at 19:21 
UTG on DOY 212 2013. Blue dots show detrended 
along-flow displacement: red dots show height. 
Shading shows lo position errors. Horizontal dis- 
placement has trends from 30 to 10 min before tc 
removed (A = 28.9 m/day, B = 24.6 m/day). 
Height has mean removed. (C and D) Plan view of 
GPS traces shown in (A) and (B) during the 30 min 
surrounding tc (marked as 0). 
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Fig. 3. Scaled laboratory data from glacier “terminus” during “iceberg” capsize event, compared 
with field observations. (A) Horizontal displacement scaled from force (black line) compared with 
downflow GPS data (blue). (B) Vertical displacement scaled from pressure (black line) compared with 
vertical GPS data (red). Errors in laboratory data are standard deviation calculated from repeated 
capsize events. GPS data shown are as in Fig. 2A. Photographs show stages of capsize at times marked 
by dashed lines and (solid gray line) fc- The aspect ratio of the model iceberg is 0.22. 



Fig. 4. Cartoon of glacier 
terminus during calving event. 

Glacier deflection caused by a 
capsizing iceberg is shown 
relative to the initial glacier 
position (dotted line). Acceleration 
of the iceberg to the right exerts a 
force in the upgiacier direction 
(left), leading to reverse motion of 
the GPS sensors (green star). 

Reduced pressure behind the 
iceberg (L) draws water from 
beneath the glacier and from the proglacial fjord, pulling the floating portion of the glacier downward and 
exerting an upward force on the solid Earth. 




vertical force observed in the earthquake. The 
lowered water pressure draws down the ungrounded 
glader margin, pulling the glacier surfece downward 
during the earthquake. 

Our results document the forces active during 
an increasingly important class of calving events 
and definitively identify the processes that cause 
glacial earthquakes. This understanding of gla- 
cier calving and glacial earthquakes opens the 
potential for remote quantitative characteriza- 
tion of iceberg calving and calving rates, as well 
as improved models for ice-ocean interaction. 
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PLANT SCIENCE 

Morphinan biosynthesis in opium 
poppy requires a P450-oxidoreductase 
fusion protein 

Thilo Winzer,^ Marcelo Kern/ Andrew J. King/ Tony R. Larson/ 

Roxana I. Teodor/ Samantha L. Donninger/ Yi Li/ Adam A. Dowle/ 

Jared Cartwright/ Rachel Bates/ David Ashford/ Jerry Thomas/ 
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Morphinan alkaloids from the opium poppy are used for pain relief. The direction of 
metabolites to morphinan biosynthesis requires isomerization of (S)- to (/?)-reticuline. 
Characterization of high-reticuline poppy mutants revealed a genetic locus, designated 
STORR [(S)- to (R)-reticuHne^ that encodes both cytochrome P450 and oxidoreductase 
modules, the latter belonging to the aldo-keto reductase family. Metabolite analysis of 
mutant alleles and heterologous expression demonstrate that the P450 module is 
responsible for the conversion of (S)-reticuline to 1,2-dehydroreticuline, whereas the 
oxidoreductase module converts 1,2-dehydroreticuline to (/?)-reticuline rather than 
functioning as a P450 redox partner. Proteomic analysis confirmed that these two 
modules are contained on a single polypeptide in vivo. This modular assembly implies 
a selection pressure favoring substrate channeling. The fusion protein STORR may 
enable microbial-based morphinan production. 



T he naturally occurring opiates of the 
morphinan subclass of benzylisoquinoline 
alkaloids (BIAs) include morphine, codeine, 
and thebaine. Morphine and codeine can 
be directly used as analgesic painkillers, 
and thebaine is widely used as a feedstock for 
the synthesis of a number of semisynthetic opi- 
ates, including hydrocodone, hydromorphone, 
oxycodone, and oxymorphone, as well as the 
opioid antagonist naloxone. The discovery and 
isolation of morphine from the opium poppy 
(Papaver somniferum L.) by Friedrich Sertlirner 
in 1806 (i) are a milestone in the history of 
pharmacy. More than 200 years later, opiate 
alkaloid-based pharmaceutical formulations 
remain the most potent treatment for severe 
pain, with sales totaling $US11.6 billion in 
2013 (2). 

BIAS are found in a number of species in the 
Papaveraceae family, but morphine production 
has only been reported in the opium poppy and 
the closely related P. setigerum (3). The morphinan 
backbone contains five asymmetric carbon cen- 
ters. Total chemical synthesis, although possible 
(4), is not an economically viable means of pro- 
duction. Consequently, morphinan alkaloids are 
still exclusively sourced from the opium poppy 
plant. Much effort has gone into the elucidation 
of the morphinan branch of BIA metabolism 
over the past 25 years, resulting in the identi- 
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fication of genes for all but the gateway step in- 
volving the epimerization of (S)- to (P)-reticuline 
{3, 5-14). Thus, although it has been possible to 
produce both {S)~ and racemic mixtures of {R,S)~ 
reticuline and morphinans in metabolically en- 
gineered microbial systems (15-18), the clean 
enzymatic conversion of (S)- to (P)-reticuline 
remains the goal. 

(S)-reticuline is the central intermediate of 
BIA metabolism (fig. SI), and conversion to 
its R epimer is believed to be a two-step pro- 
cess (19, 20). The S epimer is first oxidized to 
the quaternary positively charged amine 1,2- 
dehydroreticuline, followed by reduction to 
(i?)-reticuline (Fig. lA). Activities for each step 
have been reported, but the identity of the cor- 
responding proteins has not been established 
(19, 20). We have combined a candidate gene 
approach with genetic analyses of F 2 popula- 
tions of P. somniferum segregating for muta- 
tions that are deficient in (S)- to (R)-reticuline 
conversion and discovered that a fusion pro- 
tein is responsible for sequentially catalyzing 
both steps of the epimerization. 

RNA interference (RNAi) knockdown of co- 
deinone reductase in the opium poppy was re- 
ported to cause accumulation of (S')-reticuline, 
which is eight steps upstream of the codeinone 
reductase substrate (21), a result the authors at- 
tributed to metabolite channeling. We considered 
an alternative hypothesis to be the off-target co- 
silencing of a closely related oxidoreductase in- 
volved in the conversion of (S)- to (R)-reticuline. 
Using the sequence of the RNAi silencing con- 
struct to query an in-house expressed sequence 
tag (EST) library from stem and capsule tissue of 
opium poppies (22), we identified a contiguous 
assembly comprising a cytochrome P450 mono- 



oxygenase that is 3 -linked to an oxidoreductase. 
Sequencing cDNA clones from opium poppy stems 
confirmed the in-frame fusion transcript (Fig. lA 
and fig. S2). The P450 module was designated 
CYP82Y2. 

To investigate whether the corresponding 
gene is a candidate for one or both steps in the 
epimerization of (S)- to (R)-reticuline, we se- 
quenced corresponding cDNA clones from three 
independent mutants identified from an ethyl 
methanesulfonate-mutagenized population of a 
high-morphine cultivar, HM2 (23). All three mu- 
tants have lost the ability to produce morphinan 
alkaloids and instead accumulate high levels of 
(S)-reticuline as well as the (S)-reticuline-derived 
alkaloids laudanine and laudanosine (Fig. 1, B 
and D). We found that all three mutant lines 
carry mutations in the corresponding gene lo- 
cus (Fig. lA), which we name STORR [(S)- to 
(Rfreticuline']. The storr-1 allele carries a pre- 
mature stop codon corresponding to amino acid 
position W668 in the oxidoreductase module 
of the predicted fusion protein, storr-1 plants 
also contain low but significant levels of 1,2- 
dehydroreticuline (Fig. 1C), which suggests that 
the oxidoreductase module catalyzes the sec- 
ond step of the epimerization, the reduction of 
1,2-dehydroreticuline to (R)-reticuline. storr-2 
and storr-3 are both disrupted in the CYP82Y2 
module: storr-2 contains a premature stop at 
codon position W278, and storr-3 contains a 
missense mutation causing a glycine-to-arginine 
substitution at position 550 (Fig. lA). Dried 
capsules of storr-2 and storr-3 accumulate (S)- 
reticuline but not 1,2-dehydroreticuline, sug- 
gesting that the CYP82Y2 module is responsible 
for the first epimerization step, the oxidation 
of (S')-reticuline to 1,2-dehydroreticuline. Com- 
plementation tests and F 2 segregation analysis 
confirmed that a single genetic locus is respon- 
sible for the high-reticuline phenotype, associ- 
ation of the three recessive storr alleles with 
the high-reticuline phenotype, and the consec- 
utive roles of the CYP82Y2 and oxidoreductase 
modules in the epimerization of (S)- to (R)- 
reticuline (tables S2 and S3). 

To establish whether the STORR locus is not 
only transcribed but also translated as a fusion 
protein, we used a quantitative mass spectrom- 
etry approach after gel fractionation of crude 
protein extract from HM2. Peptides from across 
the entire STORR protein were found to be 
most abundant in the gel regions covering the 
100.65-kD predicted size of the fusion protein, 
confirming this as the in vivo form (Fig. 2). For 
direct functional characterization, the STORR 
fusion protein and the separate modules were 
expressed in Saccharomyces cerevisiae, and en- 
zyme assays were performed on soluble extracts 
and microsomal preparations (Fig. 3). We found 
that 1,2-dehydroreticuline is converted to (R)- 
reticuline with 100% conversion efficiency by both 
the STORR fusion protein and the oxidoreductase 
module, but not by the CYP82Y2 module plus its 
redox partner (Fig. 3A). In contrast, the CYP82Y2 
module plus its redox partner catalyzed 97% con- 
version of (S)-reticuline to 1,2-dehydroreticuline, 
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Fig. 1. Characterization of opium poppy mutants disrupted in A 
the conversion of (S)- to (/?)-reticuline. (A) Schematic showing 
epimerization of (S)- to (R)-reticuline and position of the storr-1, 
storr-2, and storr-3 mutations in the predicted fusion protein. (B) 

Mean ± SD capsule reticuline content in the HM2 wild-type 
cultivar and storr mutants (HM2, n = 5; storr-1, n = 12; storr-2, n = 17; 
storr-3, n = 15). DW, dry weight. Reticuline content was verified as 
>99.2% (S)-reticuline in all mutants by chiral high-performance 
liquid chromatography (HPLC) (table SI). (C) 1,2-dehydroreticuline. 

(D) All compounds >1% total alkaloids (n = 10) are individually 
identified, with minor peaks (n = 379), grouped as “Other.” 




fS)-Reticuline 1,2-Dehydroreticuline fH)-Reticuline 



storr-2 storr-3 storr-1 

834 G/A 1 648 G/A 2004 G/A 

W278* G550R W668* 

I I 



CYP82Y2 module Oxidoreductase module 



300 cDNA nucleotides / 100 amino acids 




■ Reticuline 

■ Laudanine 

■ Morphine 

■ Laudanosine 



Codeine ■ 

Codeine-thebaine dimer ■ 
1 ,2-Dehydroreticuline ■ 

Thebaine 



Unknown (M310.1435T176) 
Unknown (M328.1538T85) 
Other 



Fig. 2. Size determination of STORR pro- 
tein in the opium poppy. Protein extracts 
from three stem samples of HM2 wild type 
were fractionated, together with size markers, 
by SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE), and the three lanes were each 
cut into 15 fractions (see horizontal lines on 
one representative lane) to resolve the pre- 
dicted fusion protein and putative individual 
0YP450 and oxidoreductase modules (101, 
65, and 36 kD, respectively). For relative 
quantification, an equal amount of a tryptic 
digest of ^^N-labeled recombinant STORR 
protein was spiked into the in-gel digest of 
each SDS-PAGE fraction before HPLG mass 
spectrometry (HPLG-MS). Ratios of peak 
areas from extracted-ion chromatograms of 
light (endogenous) to heavy (labeled) versions 
of five peptides from across the STORR 
protein sequence were compared. Ratios of 
normalized peak areas from extracted-ion 
chromatograms were converted to binary 
logarithms for the calculation of means and 
standard errors of the mean. Only measure- 
ments where the respective peptides were 
found in all three biological replicates are 
shown. 




■ YGPIFSFPTGSHR 
(pos. 90-102) 

■ IVCGFQSGPK 
(pos. 234-243) 

■ LYPASPVVER 
(pos. 433-442) 

■ IKPCVQSAASER 
(pos. 568-579) 

IGEIPQCR 
(pos. 869-876) 
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Retention time (mins) 

Fig. 3. Functional characterization of the STORR fusion protein by 
heterologous expression in S. cerevislae. (A) HPLC-MS analysis of the 
in vitro conversion of 1,2-dehydroreticuline to (P)-reticuline. Crude soluble 
(sol) or microsomal (ms) preparations harboring the empty pESC-TRP 
vector (green), vector containing the oxidoreductase module (purple), an 
opium poppy cytochrome P450 reductase (CPR) redox partner (black), CPR 
+ CYP82Y2 (blue), or CPR + the CYP82Y2-oxidoreductase fusion (red) were 
assayed (21). The solid lines of the HPLC-MS chromatograms show the 
normalized total ion count at a mass/charge ratio (m/z) of 328, corre- 
sponding to 1,2-dehydroreticuline (substrate), whereas the dotted lines 
show the normalized total ion count at m/z 330, corresponding to reticuline 
(product). The inset panel shows the chiral analysis of reticuline: The gray 




Retention time (mins) 

trace is for an (S)- and (R)-reticuline standard, and the purple and red traces 
correspond to reticuline derived by activity of the oxidoreductase and 
CYP82Y2-oxidoreductase fusion, respectively. (B) HPLC-MS analysis of the 
conversion of (S)-reticuline into 1,2-dehydroreticuline and (R)-reticuline. 
Crude microsomal preparations obtained from S. cerevisiae harboring 
expression vector pESC-TRP containing CPR only (black), CPR -t- a CYP82Y2 
module (blue), or CPR -r CYP82Y2-oxidoreductase fusion (red) were 
assayed (21). The solid lines of the HPLC-MS chromatograms show the 
normalized total ion count at m/z 328, corresponding to 1,2-dehydroreticu- 
line, and the dotted lines show the normalized total ion count at m/z 330, 
corresponding to reticuline. The inset panel shows the chiral analysis of 
reticuline, with the same line colors as in the main panel. 



demonstrating that it acts as a 1,2-dehydro- 
reticuline synthase (Fig. 3B). Microsomal prep- 
arations harboring the entire STORR fusion 
protein converted about 20% of the added 
(S)-reticuline to (R)-reticuline, confirming the 
bifunctional role of the protein in performing 
sequential reactions in the epimerization of 
reticuline. Kinetic analysis revealed that the 
microsomal CYP82Y2 module alone and the 
CYP82Y2-oxidoreductase fusion had similar 
Michaelis constant values of 13 and 14 pM for 
(S')-reticuline and 1,2-dehydroreticuline, respec- 
tively (fig. S3). Consistent with the plant mutant 
phenotypes, we found that microsomally ex- 
pressed STORR carrying the storr-2 mutation 
lacks both the P450 and oxidoreductase activ- 
ities, the storr-3 mutation lacks the P450 acti- 
vity but does still exhibit the oxidoreductase 
activity, and the storr-1 mutation has lost the 
oxidoreductase activity but maintains very low 
levels of P450 activity (fig. S4). 

P450-redox systems where the P450 enzyme 
is covalently linked to redox partner reductase 
components are well known in both prokaryotes 
and lower eukaryotes (24). In the STORR fusion 
protein, the P450 module is linked to a reduc- 
tase, but rather than functioning as a redox 
partner for the P450, this reductase catalyzes 
the product of the P450 to complete a two-step 
epimerization of (S)- to (R)-reticuline. Other forms 
of bifunctional P450 fusions with oxygenase/ 
peroxidase, hydrolase, and dioxygenase modules 
have been reported to occur in ascomycetes, and 
all of these also appear to catalyze sequential 



reactions (25-27). A possible explanation as to 
why such fusion proteins evolve is that they fa- 
cilitate efficient channeling of highly unstable 
or reactive intermediates. Evidence for efficient 
substrate channeling in the case of the STORR 
fusion protein comes from the observation that 
microsomal fractions harboring the fusion pro- 
tein directly convert (S)- to (R)-reticuline, with 
no detectable accumulation of 1,2-dehydroreticu- 
line (Fig. 3B). 

Phylogenetic analysis suggests that the STORR 
fusion occurred after the split of the CYP82X 
and the CYP82Y subfamilies and that the oxi- 
doreductase module falls into subfamily 4 of 
the aldo-keto reductases, with closest homol- 
ogy to the codeinone reductase family from 
P. somniferum (fig. S5). A query of the IK plant 
transcriptome resource (28) identified similar 
predicted module arrangements in EST collec- 
tions from two other morphinan-producing 
Papaver species, P. hracteatum and P. setigerum, 
but not in P. rhoeas, which does not make mor- 
phinans (table S4). We hypothesize that the 
STORR fusion was the key step in the evolution 
of the morphinan branch of BIA metabolism. 
After this step, other enzymes were recruited 
and adapted, including dioxygenases and re- 
ductases, ultimately giving rise to codeine and 
morphine in P. somniferum and P. setigerum 
(29). Thus, this morphinan biosynthetic path- 
way, and probably other plant secondary meta- 
bolic pathways, depends on the organization 
of both individual gene structure and genome 
rearrangement (22). 
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CIRCADIAN RHYTHMS 

Atomic-scale origiiis of slowness in 
the cyanobacterial circadian clock 

Jun Abe/* Takuya B. Hiyama/* Atsushi Mukaiyama/’^* Seyoung Son/*f 

Toshiiumi Mori/’^* Shinji Saito/’^’^ Masato Osako/ Julie Wolanin/’^ Eiki Yamashita/ 

Takao Kondo/ Shuji Akiyama^’^t 

Circadian clocks generate slow and ordered cellular dynamics but consist of fast-moving 
bio-macromolecules; consequently, the origins of the overall slowness remain unclear. 

We identified the adenosine triphosphate (ATP) catalytic region [adenosine triphosphatase 
(ATPase)] in the amino-terminal half of the clock protein KaiC as the minimal pacemaker 
that controls the in vivo frequency of the cyanobacterial clock. Crystal structures of the 
ATPase revealed that the slowness of this ATPase arises from sequestration of a lytic water 
molecule in an unfavorable position and coupling of ATP hydrolysis to a peptide isomerization 
with high activation energy. The slow ATPase is coupled with another ATPase catalyzing 
autodephosphorylation in the carboxyl-terminal half of KaiC, yielding the circadian 
response frequency of intermolecular interactions with other clock-related proteins that 
influences the transcription and translation cycle. 



C ircadian clocks comprise suites of biological 
processes that oscillate with a 24-hour pe- 
riod (i). Clock genes and clock proteins 
are present in prokaryotes and eukaryotes 
(2, 3); together, they constitute feedback 
loops that effect transcriptional and transla- 
tional oscillations (TTOs). The origin of the slow 
circadian time scale is thought to be the time 
delay between clock gene transcription and feed- 
back signals that regulate it; however, the tran- 
scriptional and translational events can occur 
quickly (i.e., within minutes) (4). Posttranslational 
oscillations (PTOs) (5-7) in biochemical modi- 
fications of clock proteins occur even without 
transcriptional and translational regulation. 
Proteins generally exhibit dynamics within pico- 
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seconds or seconds (8), much faster than the 
circadian time scale. Thus, both TTO and PTO 
circadian systems are assembled from building 
blocks with intrinsically fast dynamics, raising 
questions about how and why the systems are 
so slow and stable overall (9). 

The cyanobacterium Syneckococcus elongatus 
PCC7942 is the simplest organism known to have 
both TTOs (JO) and PTOs (5). The S. elongatus 
PTOs can be reconstructed in vitro by incubat- 
ing a core clock protein, KaiC, with two other 
clock proteins, KaiA and KaiB, and adenosine 
triphosphate (ATP) (5). The rhythmic behaviors 
of the Kai oscillator have been confirmed in 
many functional and structural analyses, which 
have probed the ATP hydrolysis [adenosine tri- 
phosphatase (ATPase)] activity of KaiC {11, 12), 
autophosphorylation and autodephosphoryla- 
tion activities of KaiC {6, 13, 14), conformational 
transitions of the proteins {12, 15, 16), and as- 
sembly or disassembly of Kai complexes {17-20). 
Because the PTO period in S. elongatus is firmly 
correlated to the TTO period during the day 
(5, 6), the in vitro Kai oscillator should enable 
us to identify the mechanisms underlying the 
slowness of the circadian clock. 

We searched the Kai oscillator for a minimal 
slow reaction whose efficiency correlated with 
in vivo TTO frequency (Fig. 1) (see supplementary 
materials and methods). The ATPase activity 
of full-length wild-type KaiC (KaiC-WT), con- 



sisting of the N-terminal Cl and C-terminal C2 
domains, has been proposed as the basic timing 
cue {11, 12). We identified the steady-state ATPase 
activity of the Cl domain (Cl-ATPase) as a suit- 
able slow reaction. A truncated version of KaiC 
consisting solely of the N-terminal domain, KaiCl- 
WT, exhibited much slower ATP hydrolysis (11 + 
1 ATP per day per KaiC at 30°C) than well-known 
motor proteins (10^ to 10^ day"^) such as myosin, 
kinesin, and F^-ATPase (table SI). We confirmed 
the correlation of this activity with the in vivo TTO 
frequency using a series of period-modulating 
KaiC mutations in the Cl domain 
(S157P), S157C, T42S, S48T] {21) (Fig. ID), in which 
higher steady-state Cl-ATPase in vitro resulted 
in higher-frequency TTOs in vivo. Thus, KaiC 
should experience a certain number of hydrol- 
ysis events per cycle in vivo, and the absolute 
rate of ATPase activity is connected to the cel- 
lular clock’s overall slowness {11, 22). Therefore, 
we examined the structural origin of the slow 
Cl-ATPase and its coupling with TTOs via spatio- 
temporally distinct events, including the intra- 
molecular KaiC ATPase and its phosphorylation 
cycles, as well as intermolecular interactions with 
KaiA and KaiB (Fig. 1). 

To this end, we crystallized KaiCl-WT and five 
period-modulating variants in the pre- or posthy- 
drolysis states, or both. All resultant crystals were 
in the P2i2i2i space group (Fig. 2, A and B, fig. SI, 
and table S2). The prehydrolysis states (Fig. 2A, 
blue subunits) exhibited common features: assem- 
bly of six subunits into a hexamer and incorpo- 
ration of one molecule of the slowly hydrolyzed 
ATP analog adenosine 5'-(y-thiotriphosphate) 
(ATP-y-S) into every subunit-subunit interface. 
ATP^-S existed in a complex with a Mg^^ ion (Mg- 
ATP-y-S) in the ordinaiy octahedral-coordination 
geometry (fig. S2A). We observed the posthydrol- 
ysis state of the long-period variant KaiCl-S48T 
(Fig. 2B, orange and green subunits), which crys- 
tallized as an asymmetrically ATP- or adenosine 
diphosphate (ADP)-bound hexamer (fig. S2B). 
The ring-shaped hexamer was deformed asym- 
metrically due to steric constraints resulting from 
close juxtaposition of three types of subunits, creat- 
ing both tight and loose intersubunit interfaces 
(Fig. 2B, fig. S3, and supplementary text). 

We identified two structural sources of slow 
ATPase activity. The first is the regulatory influ- 
ence of the protein moiety on a lytic water mol- 
ecule (Wl) near the phosphorus atom (P^) of the 
y-phosphate group of an ATP. In the prehydrol- 
ysis state (blue dotted box in Fig. 2C), Wl was 
sequestered (with a Wl-P.^, distance of 3.8 to 
3.9 A and a Wl-P^-Osp angle of 154° to 158°) by 
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H-bonding to the F199 carbonyl oxygen, the 
nitrogen atom of R226 side chain (N^), and 
another water molecule (W2). This position of W1 
was much farther from a near-in-line configu- 
ration, with respect to the Py-Osp bond (3.0 A, 
180°), than analogous water molecules in other 
motor proteins that hydrolyze ATP efficiently— 
e.g., kinesin (3.3 A, 164° to 167°) (23), myosin 
(2.3 A, 170°) (24), and Fi-ATPase (3.1 to 3.8 A, 
168° to 172°) (25) (Fig. 2D and table SI). Molecu- 
lar dynamics simulation suggested that W1 
was only marginally stabilized. Although a water 
molecule might occasionally migrate into and 
out of the W1 position, the near-in-line position 
was completely inaccessible due to steric hin- 
drance from the F199 carbonyl oxygen and the 
R226 (Fig. 2E and movies SI to S3). Thus, 
the sequestration of W1 from the near-in-line 
position is one atomic-scale origin of slow hy- 
drolysis. In another crystal form of KaiCl-WT 
(P3i21 space group; ocher subunits in Fig. 2F and 
fig. SIA), W1 was even farther from the P^ of the 
y-phosphate group, which should be even more 
unfavorable to hydrolysis than the W1 position 
that we observed in the P2i2i2i space group. In 
yet another crystal form, more or less unfavorable 



positioning of W1 occurred asymmetrically within 
the hexamer (fig. SIB). On the basis of nine struc- 
tures (Fig. 2F, fig. SI, and supplementary text), 
we identified S157P and Q153A as mutations that 
indirectly stabilized the less unfavorable position- 
ing of W1 (blue subunit), resulting in higher 
Cl-ATPase activity (Fig. IJ). By contrast, S157C 
stabilized the more unfavorable positioning of 
W1 (ocher subunit) through N-terminal fray of 
a7 (fig. S4) and repositioning of a6, resulting in 
lower Cl-ATPase activity (Fig. IJ). 

The second origin of slowness involves cou- 
pling of ATP hydrolysis to slow cis-trans isomeriza- 
tion of the peptide chain. In the prehydrolysis 
state, the peptide bond between D145 and S146 
(D145 s 146 mainly adopted the cis confor- 

mation (Fig. 3A). By contrast, in the posthydroly- 
sis state, the peptide was entirely in the 

trans conformation (Fig. 3A and fig. S5A). This 
trans selectivity was achieved through hydrolysis 
of ATP bound in the counterclockwise (CCW) 
interface (Fig. 2C, dotted boxes: blue ^ orange ^ 
green; see also supplementary text), along with 
repositioning of helices a6 and a7 on the clock- 
wise (CW) side (Fig. 3A). These structural obser- 
vations support the idea that hydrolysis of ATP 



bound in the CCW interface results in a confor- 
mational change that forces the peptide 

to adopt the trans conformation. 

According to our computational results, the 
prehydrolysis state containing the cis-D^^^S^'^^ 
peptide must overcome a barrier of 14 to 16 kcal 
mor^ (Fig. 3B) for cis-trans isomerization and 
a barrier of 11 to 17 kcal moF^ (table SI) to 
disrupt the Py-Osp bond. These two events are 
potentially related to each other through re- 
positioning of helices a6 to a8 (fig. S5G and 
supplementary text). Given a frequency factor of 
~10^^ s~\ the rate of a reaction that must over- 
come a barrier of ~22 kcal moF^ is -0.5 hours"^ 
(-12 ATP day“^) at 30°C. The impact of cis-trans 
flipping was confirmed using the KaiCl-S146P 
mutant, in which the peptide is forced 

to adopt the cis conformation (fig. S5B). More- 
over, full-length KaiC-S146P exhibited a nota- 
ble decrease in steady-state ATPase activity 
(4.8 ATP per day per KaiC), as well as a nearly 
50% increase in the phosphorylation cycle pe- 
riod (34 hours). Thus, cis-trans flipping of the 
j)i45gi46 pgpti(jg could impose a substantial en- 
ergy barrier on the hydrolysis-coupled transition 
of the subunits. 
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Fig. 1. trans-hierarchical nature of the cya- 
nobacterial circadian clock. (A to J) Scatter- 
plot matrix showing all pairwise correlations 
among the frequency of the in vivo TTO cycle 
(10), frequency of the in vitro phosphorylation 
cycle of the Kai oscillator, undamped natural 
frequency (cop) of KaiC alone, steady-state 
ATPase in the full-length KaiC-WT background, 
and steady-state Cl-ATPase in the KaiCl-WT 
background, r, correlation coefficient. 
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In full-length KaiC, the slow Cl-ATPase is in- 
tegrated with another ATPase in the C2 domain 
(C2-ATPase), forming a coupled Cl-C2-ATPase 
system. At steady state, the Cl-ATPase predomi- 
nated over the C2-ATPase (Fig. U). The pacemak- 
ing role of Cl-ATPase became obvious during the 
approach to the steady state (pre-steady state). 
Immediately after the addition of excess ATP to 
full-length apo-KaiC protein (26), nascent KaiC- 
WT hydrolyzed 35 ATP per day per KaiC at 30°C 
(Fig. 4A and fig. S6A). The activity suddenly de- 
creased to ~8 ATP per day per KaiC during the 
first 6 hours and then stably recovered to and 
remained at 13 + 1 ATP per day per KaiC, the ac- 
tivity required for steady-state KaiC-WT (11). The 
biphasic-exponential phases reflected the pre- 
steady-state relaxation of the coupled C1-C2- 
ATPase (supplementaiy text). Minimal dependen- 
cies on ATP, ADP, and KaiC protein concentrations 
(fig. S6B) fiirther supported the hydrolysis-related 
maturation of ATP- or ADP-bound KaiC-WT hex- 
amer (supplementary text). The phosphorylation 



state of the C2 domain was only weakly associated 
with the slow process, as the phosphorylation- 
mimic KaiC-S431D/T432E exhibited detectable 
relaxation (fig. S6C). 

Response speed of the Cl-C2-ATPase is con- 
trolled by the Cl-ATPase. Long-period Cl mutants 
(T42S, S157C, S48T, and A251V) with suppressed 
Cl-ATPase activity exhibited slower relaxations 
of the Cl-C2-ATPase than KaiC-WT, whereas short- 
period Cl mutants (Q153A and S157P) with elevated 
Cl-ATPase activity exhibited more rapid relaxation 
with a deeper undershoot (Fig. 4B). These obser- 
vations suggested that steady-state Cl-ATPase 
activity governs the speed of pre -steady-state 
relaxation of the Cl-C2-ATPase and oscillation 
period in vitro (Fig. IG) and in vivo (Fig. ID). The 
undamped natural frequency (coj is another mea- 
sure of the response speed of the coupled C1-C2- 
ATPase, as determined by fitting each transient 
curve of ATPase activity (Fig. 4B) to the initial- 
state response curve of the quasi-second-order 
system (supplementary text). Furthermore, the cOn 



value of KaiC-WT alone in vitro was 0.91 + 0.01 
day"^, matching the circadian oscillatory fre- 
quency. We confirmed fine correlations among 
steady-state Cl-ATPase activity, cOn value, and 
in vivo frequency of TTOs for a series of the 
period-modulating Cl mutations (Fig. 1, B, D, 
and I). Thus, the slow Cl-ATPase contributes 
to the circadian pacemaker for both the C1-C2- 
ATPase system and TTOs. 

Slow relaxation of the Cl-C2-ATPase was re- 
lated to emergence of temperature compensa- 
tion in full-length KaiC. Early ATPase activity 
(<2 hours in Fig. 4C) decayed in a temperature- 
dependent manner (thermal sensitivity = 1.4) 
(inset of Fig. 4C), whereas steady-state activity 
after the later decay (>24 hours) was almost in- 
dependent of temperature (Q{^“ ~ 1.1). Because 
the contribution of C2-ATPase was negligibly 
small at steady state (Fig. IJ), the activity that 
did not vary with temperature can be attributed 
to the Cl-ATPase. Thus, steady-state Cl-ATPase 
activity may be kept constant in each hexamer 
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Fig. 2. Reaction cycle of the Cl-ATPase, a slow but stable ATPase that serves as the basic timing 
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other crystai structures, refer to detaiis in fig. SI. (B) Schematic overview of the crystai structure of 
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box) of Fig. 2B, and posthydroiysis C-D interface (green dotted box) of Fig. 2B. The mesh indicates the 
Fobs - Fca\c omit map of a potentiai iytic water moiecuie (Wl) and another water moiecuie (W2), contoured 
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by adjusting the ratio of prehydrolysis blue and 
ocher subunits and by scrambling their spatial 
arrangement (Fig. 2A and fig. SI, A and B). In 
this respect, oscillation of ATPase activity (Fig. 
4A) is realized by perturbing the intramolecular 



homeostasis of KaiC ATPase via intermolecular 
interactions with KaiA and KaiB. 

The Cl-C2-ATPase is coupled indirectly to in- 
teractions with KaiA and KaiB. Binding af- 
finities of KaiA and KaiB for pre-steady-state 



KaiC-WT were modulated in a biphasic and par- 
abolic manner on the same time scale as relax- 
ation of the Cl-C2-ATPase (fig. S7, A and B). 
However, such time evolution was lost in KaiC- 
S431D/T432E (fig. S7, A and B), which retained 
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Fig. 4 . Circadian periodicity determined by KaiC 
ATPase activity. (A) Damped oscillation observed 
for pre-steady-state relaxation of KaiC ATPase as 
the source of the circadian time scale at 30°C. In the 
presence of both KaiA and KaiB, the ATPase activity of 
KaiC-WT exhibits a stable oscillation around its steady- 
state ATPase activity. The bar at lower left indicates the 
maximal contribution from ATP synthesis during the first 
hour (27). (B) Dynamic responses of the ATPase activity 
observed for period mutants of full-length KaiC. Each 
curve is offset longitudinally for clarity of presentation. 
The values in parentheses represent the periods of 
in vitro phosphorylation rhythms. (C) Temperature de- 
pendency of pre-steady-state dynamics of ATPase ac- 
tivity in KaiC-WT. The insets represent Arrhenius plots. 
Errors in values are derived from a linear regression analy- 
sis. EA represents the apparent activation energy. 



SCIENCE sciencemag.org 



17 JULY 2015 • VOL 349 ISSUE 6245 315 



RESEARCH \ REPORTS 



pre-steady-state relaxation of Cl/C2-ATPase (fig. 
S6C). Thus, Cl-C2-ATPase status is coupled to 
intermolecular interactions with KaiA and KaiB 
through a phosphorylation-dependent confor- 
mational change of KaiC. This idea is also sup- 
ported by the fact that the C2-ATPase is essential 
for the complete autodephosphorylation (27). 
Thus, absolute slowness encoded in the Cl-ATPase 
is transferred to and correlated with the TTO cy- 
cle (Fig. 1) through a pathway in which non-steady- 
state relaxation of Cl-C2-ATPase leads to cycles 
of phosphorylation that alter intermolecular in- 
teractions with KaiA or KaiB or other clock- 
related proteins (28, 29). 

In cyanobacteria, slow homeostatic regulation 
of the coupled Cl-C2-ATPase is critical for cir- 
cadian periodicity (Figs. 1 and 4A) and also has 
an important role in entrainment of individual 
KaiC molecules by external stimuli such as tem- 
perature changes (30). Our results suggest how 
ancient cyanobacteria have incorporated Earth’s 
rotation period into their molecular systems. 
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INFLAMMATION 

Neutrophil extracellular traps license 
macrophages for cytokine production 
in atherosclerosis 



Annika Warnatsch, Marianna loannou, Qian Wang, Venizelos Papayannopoulos* 

Secretion of the cytokine interleukin-lp (ILTp) by macrophages, a major driver of pathogenesis 
in atherosclerosis, requires two steps: Priming signals promote transcription of immature 
ILTp, and then endogenous “danger” signals activate innate immune signaling complexes 
called inflammasomes to process IL-lp for secretion. Although cholesterol crystals are 
known to act as danger signals in atherosclerosis, what primes ILTp transcription remains 
elusive. Using a murine model of atherosclerosis, we found that cholesterol crystals acted 
both as priming and danger signals for ILTp production. Cholesterol crystals triggered 
neutrophils to release neutrophil extracellular traps (NETs). NETs primed macrophages for 
cytokine release, activating T helper 17 (Th17) cells that amplify immune cell recruitment in 
atherosclerotic plaques. Therefore, danger signals may drive sterile inflammation, such as that 
seen in atherosclerosis, through their interactions with neutrophils. 



I nfiammation is critical against infection but 
must be regulated by multiple checkpoints 
to prevent inflammatory disease (1). During 
infection, cytokine transcription is triggered 
by microbial molecules that activate pattern 
recognition receptors (2). Release of mature active 
cytokines requires additional “danger” signals as- 
sociated with host cell damage. Known as danger- 
associated molecular patterns (DAMPs), these 
secondary signals activate NLRP3 and other in- 
flammasomes, promoting cleavage and activation 
of the protease caspase-1 that processes cytokines 
such as interleukin-ip (IL-ip) into their mature 
form (3). 

IL-ip plays a critical role in the development of 
atherosclerosis and other inflammatory diseases. 
Because of its low solubility, cholesterol crystallizes 
in circulation and is taken up by monocyte-derived 
macrophages (4-6), activating their inflamma- 
somes to release IL-ip and other proinflamma- 
toiy cytokines (7). These molecules recruit myeloid 
cells to the vascular endothelium, where their 
cholesterol content generates obstructive lesions 
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(8). In atherosclerosis and other sterile inflam- 
matory diseases, the endogenous priming signals 
that activate IL-ip transcription prior to inflam- 
masome activation remain unknown. 

IL-ip up-regulates chemokines that recruit 
neutrophils to atherosclerotic lesions (9-11). Neu- 
trophils are implicated in disease (12, 13), but their 
role in pathogenesis remains poorly understood 
To combat pathogens that evade phagocytosis (14), 
neutrophils release neutrophil extracellular traps 
(NETs) composed of decondensed chromatin and 
antimicrobials (75). NETs are implicated in several 
inflammatory diseases (16), but their pathogenic 
mechanism and role in atherosclerosis are unclear. 

To examine how neutrophils respond during 
atherosclerosis, we investigated the effect of cho- 
lesterol crystals on human blood-derived neutro- 
phils. Cholesterol crystals induced NET formation 
(NETosis) (Fig. 1, A and B) at concentrations re- 
quired to activate the inflammasome (fig. SI, A 
and B) (7) as efficiently as microbes (14), trigger- 
ing a reactive oxygen species (ROS) burst (fig. SIC) 
and neutrophil elastase (NE) translocation to the 
nucleus (fig. SID), a critical step for NETosis (17). 
NETosis depended on ROS, as it was blocked 
by diphenylene iodonium [DPI; an inhibitor of 
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reduced nicotinamide adenine dinucleotide phos- 
phate (NADPH) oxidase] or an inhibitor (NEi) 
of the neutrophil-specific proteases NE and 
proteinase 3 (PR3) (i7) but not by Cl-amidine, 
which inhibits peptidylarginine deiminase (PAD) 
enzymes implicated in NETosis (18) (Fig. 1, A 
and B, and fig. SIE). Consistently, DPI or NEi 
blocked NE translocation to the nucleus driven 
by cholesterol (fig. SID) (19). 

Next, we examined whether NETs form during 
atherosclerosis. Previous studies reported the pres- 
ence of NETS in lesions but showed intact neu- 
trophils with condensed nuclei (20) or luminar 
rather than lesion-associated neutrophils in the 
absence of specific NET markers (21). We de- 
tected NETS as large amorphous extracellular 
structures in atherosclerotic lesions from apo- 
lipoprotein E (ApoE)-deficient mice that were 
placed on a high-fat diet (HFD) for 8 weeks to 
induce hypercholesterolemia (Fig. 1C). NETs formed 
in cholesterol-rich areas but were absent from 
adjacent adventitia (fig. S2A). To block NETosis 
in vivo, we crossed ApoE-deficient animals with 
mice deficient in PR3 and NE, because deleting 
both enzymes may abrogate NETosis more effec- 

A 



tively (22). NETs were completely absent in le- 
sions of ApoE/PR3/NE-deficient mice after 8 weeks 
on HFD (Fig. 1C) and ApoE-deficient mice after 
6 weeks on HFD receiving deoxyribonuclease 
(DNase), which degrades NETs (23) (fig. S2B). 

Subsequently, we assessed the effect of NET 
deficiency on atherosclerosis. When placed on 
HFD, ApoE- and ApoE/PR3/NE -deficient mice ex- 
hibited similar weight gain (fig. S3A) and blood 
cholesterol, triglyceride, and low-density lipo- 
protein (LDL) concentrations (fig. S3B). Analysis 
of aortic root cross sections showed that the two 
groups were modestly different after 4 weeks on 
HFD (fig. S3, C and D), suggesting that NETs did 
not play a critical role early during atherogenesis. 
However, after 8 weeks on HFD, ApoE/PR3/NE- 
deficient mice exhibited a factor of 3 reduction in 
plaque size relative to ApoE-deficient controls 
(Fig. 2, A and B). These differences were also 
reflected by en face analysis of intact aortas (fig. 
S3, E and F). DNase injections into ApoE-deficient 
mice on HFD for 6 weeks resulted in a comparable 
factor of 3 reduction in lesion size, which excludes 
the possibility that the proteases played NET- 
independent roles (Fig. 2, C and D). Lesion growth 



was unaffected by DNase in i^oE/PR3/NE-deficient 
mice that lack NETs. 

NET-deficient mice exhibited a reduction in 
lesion growth that was comparable to mice lacking 
NLRP3 or the IL-1 receptor; this finding suggested 
that NETs may drive atherosclerosis by modulating 
cytokine production. Indeed, IL-la, IL-ip, and IL-6 
were elevated in the plasma of ApoE-deficient ani- 
mals after 8 weeks on HFD but were largely absent 
in ApoE/PR3/NE-deficient mice (Fig. 3A and fig. 
S4A). After 16 weeks on HFD, IL-la but not IL- 
ip concentrations were still elevated in ApoE- 
deficient controls relative to NET-deficient animals 
(fig. S4B). DNase administration abrogated plasma 
cytokine concentrations in ApoE-deficient controls 
but had no effect on ApoE/PR3/NE-deficient mice 
(fig. S4C). Furthermore, IL-ip staining, which de- 
tects both immature and mature protein, was 
prominent in lesions from ApoE-deficient mice 
but was absent in ApoE/PR3/NE knockout ani- 
mals and colocalized with NETs and macrophages 
(Fig. 3B and fig. S4D). By contrast, IL-ip concen- 
trations were similar in the spleen or in blood 
mononuclear cells (fig. S4E), although we cannot 
exclude that systemic cytokines were not produced 
differentially elsewhere. In addition, IL-ip mRNA 
concentrations were significantly reduced in aortas 
from i^E/PR3/NE-deficient mice (Fig. 3C). To- 
gether, these data indicate that NE and PR3 were 
not mediating IL-ip maturation posttranslation- 
ally, and instead suggest that NETs are essential 
for the transcription of proinflammatoiy cytokines. 

The requirement of NETs for cytokine produc- 
tion, and the proximity of NETs to macrophages 
(Fig. 3D) and IL-ip in lesions (Fig. 3B), prompted 
us to examine whether NETs regulate cytokine 
production by macrophages. We prepared NETs 
from cholesterol crystal-stimulated neutrophils 
(fig. S5A) (24) and investigated their effects on 
CD14-purified, blood-derived human monocytes 
in vitro. Stimulation with NETs or cholesterol 
crystals separately yielded minor increases in 
IL-ip and IL-6 concentrations in culture super- 
natants (Fig. 3E and fig. S5B). In contrast, mono- 
cytes released substantial cytokine concentrations 
when pretreated with NETs and subsequently 
stimulated with cholesterol crystals (Fig. 3E). By 
comparison, neutrophils were not a major source 
of cytokines, as they released negligible concen- 
trations in response to cholesterol crystals (fig. 
S5C). Degradation of NETs by DNase treatment 
(fig. S4A) abrogated cytokine release (fig. S5B), 
indicating the requirement for a DNA moiety. 
Consistently, an oligonucleotide (ODN) antago- 
nist of the pattern recognition DNA receptor Toll- 
like receptor 9 (TLR9) significantly reduced cytokine 
release in NET-treated monocytes, but not mono- 
cytes primed with bacterial lipopolysaccharide 
(LPS) (Fig. 3E). Because TLR9 is not expressed 
in monocytes, these data suggest that DNA is 
important for monocyte activation but that its 
detection is mediated via other DNA receptors 
blocked by ODN. Blocking with oligonucleotide 
did not fully inhibit IL-ip induction, so we rea- 
soned that additional non-DNA NET factors con- 
tribute to monocyte activation. Blocking TLR2 
and TLR4, which bind endogenous proteins. 
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Fig. 1. Cholesterol ctystals trigger NETosis. (A) Fluorescence micrograph of neutrophils incubated 
with cholesterol crystals and stained with the lipid dye Dil (magenta) and extracellular DNA (Sytox, 
cyan). Neutrophils were left untreated or treated with NE inhibitor (NEi) or the NADPH oxidase inhibitor 
DPI. Scale bars, 100 iim. (B) Quantitation of NETosis in (A). Data are representative of three indepen- 
dent experiments. (C) Representative confocal immunofluorescence microscopy images of aortic root 
sections from ApoE~^~ and ApoE/PR3/NE~^~ mice on HFD for 8 weeks and stained for MPO (cyan), 
citrullinated histone 3 (Cit-H3, yellow), chromatin (magenta), and DNA (DAPI, blue). Borders between 
the adventitia (A) and the lesion (dotted line) and lumen (L) are shown; scale bars, 50 |im. The third 
row shows detail from the first row (arrow): scale bars, 20 |im. Data are representative of eight mice 
analyzed per strain from two independent experiments. 
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Fig. 2. NETS promote atherosclerosis. (A) Two representative microscopy 
images of aortic root sections from ApoE~'^~ and ApoE/PR3/NET^~ mice on 
HFD for 8 weeks and stained for lipid (Oil Red 0, red) and hematoxylin. Scale 
bars, 200 iim. (B) Quantitation of plaque area relative to the area of the aortic 
lumen from (A); data are representative of 11 mice per strain pooled from two 
independent experiments. Each point is the mean from multiple sections per 
animal. (C) Representative microscopy images of aortic root sections from 
ApoE~^~ and ApoE/PR3/NE~^~ mice on HFD for 6 weeks and regularly injected 



intravenously with 120 U of DNase or vehicle control (0.9% NaCI). Stained for 
lipid (Oil Red 0, red) and hematoxylin: scale bars, 200 |j,m. (D) Quantitation of 
(C) as in (B). Data are representative of four or five mice analyzed per strain and 
condition. A power analysis revealed 92% power for the difference of means 
between NaCI- and DNase-treated ApoE~^~ mice. Statistics by Student’s t test 
for single comparison and two-way analysis of variance (ANOVA) followed by 
Sidak’s multiple comparison post test for multiple comparisons: **P < 0.01, 
< 0.001, ****P < 0.0001: n.s., not significant. 
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Fig. 3. NETS prime macrophages for cytokine release. (A) Plasma levels of 
IL-lp from wild-type (WT), ApoE~^~, and ApoE/PR3/NE~^~ mice on HFD for 
8 weeks, measured by enzyme-linked immunosorbent assay (ELISA) in n = 17 
mice per strain pooled from three independent experiments. (B) Representa- 
tive confocal immunofluorescence microscopy images of aortic root sections 
from eight ApoE~^~ and five ApoE/PR3/NE~'^~ mice on HFD for 8 weeks and 
stained with the macrophage marker Mac-3 (cyan), IL-lp (magenta), the 
neutrophil marker Ly6G (yellow), and DNA (DAPI, blue) in two independent 
experiments. Dashed line denotes the adventitia (A)-lesion boundary: L, lumen. 
Scale bars, 50 |im. (C) Representative IL-lp mRNA levels in aorta of five ApoE~^~ 
and four ApoE/PR3/NET^~ mice fed HFD for 8 weeks, repeated in two inde- 
pendent experiments and measured by quantitative polymerase chain reaction. 
mRNA levels were normalized to the monocyte/macrophage-specific gene 
\amp2 and expressed relative to levels measured in wild-type mice. A power 
analysis measured 89% power for the difference of means between ApoET^~ and 
ApoE/PR3/NE~^~ mice. (D) Representative confocal immunofluorescence 
microscopy images of aortic root sections from five ApoE~^~ mice on HFD 



for 8 weeks and stained with the macrophage marker Mac-3 (cyan), Ly6G 
(magenta), MPO (yellow), and DNA (DAPI, blue). Scale bars, 50 gm. Right panel 
is a close-up of the boxed area of the left panel: arrowheads point to macro- 
phages. Scale bars, 20 gm. (E) Mature IL-lp (black bars, lefty axis) or IL-6 (gray 
bars, right y axis) protein released by CD-14 blood-derived human monocytes 
untreated or treated with LPS or NETs alone or in the presence of cholesterol 
crystals. Where indicated, cells were treated with oligonucleotide inhibitor (ODN: 
10 yg/ml). (F) Whole-cell lysates or cell culture medium from naive CD-14 blood- 
derived human monocytes were treated with NETs or cholesterol crystals, 
analyzed by SDS-polyacrylamide gel electrophoresis, and immunoblotted for 
IL-lp, caspase-1, and actin. (G) IL-lp (left panel) or IL-6 (right panel) mRNA in 
naive CD-14 blood-derived human monocytes or treated with NETs alone (black 
bars) or in the presence of cholesterol crystals (gray bars). Statistics in (A) 
and (C) by two-tailed, unpaired Student’s t test for single comparison and 
one-way ANOVA, followed by Tukey’s multiple comparison posttest for multiple 
comparisons: **P < 0.01. In (E) and (G), data are representative of three inde- 
pendent experiments across three technical replicates: error bars denote SD. 
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decreased IL-ip release and synergized with 
oligonucleotide inhibition, indicating that both 
protein and DNA moieties are important in NET- 
mediated priming (fig. S5D). The complete abro- 
gation by DNase suggests that the association of 
these moieties is critical. In the absence of cho- 
lesterol, NETS did not induce substantial in- 
fiammasome activation, as reflected by the lack 
of caspase-1 and IDip maturation (Fig. 3F), which 
were observed only upon costimulation with cho- 
lesterol crystals. Stimulation with NETs also up- 
regulated monocytic cytokine transcripts (Fig. 3G). 
These data are consistent with NETs providing 
priming signals in atherosclerosis. 

IL-ip up-regulates the T cell-derived cytokine 
IL-17, which drives the chemokines CXCLl and 
CXCL2 to promote neutrophil recruitment dur- 
ing inflammation (9, 2S). Both groups of mice 
exhibited comparable numbers of T cells (figs. S6 
and S7), but aortas from ApoE/PR3/NE-deficient 
mice on 8-week HFD contained few IL-17^ T cells 
relative to ApoE-deficient controls (Fig. 4, A and 
B). IL-17-producing T cells could not be detected 
in aortas of ApoK^~ animals after 4 weeks on 
HFD (fig. S8, A and B), which suggests that strong 
T cell activation does not precede NET-driven 



inflammation. The blood of NET-deficient mice did 
not exhibit alterations in immune cell populations, 
and circulating IL-17^ T cells were absent in both 
groups (figs. S9 and SIO). The spleen and lymph 
nodes contained a small IL-17^ y5 T cell population 
but no IL-17^ ap T cells (figs. Sll and S12, A and B). 
Furthermore, concentrations of IL-17A, CXCLl, 
CXCL2, and the monocyte chemokine CCL2 were 
also significantly reduced in aortas of i^E/PR3/NE- 
deficient mice (Fig. 4C). Consistently, we counted 
fewer neutrophils in lesions and adventitia of 
ApoE/PR3/NE-deficient mice, by microscopy (Fig. 
4D) and fluorescence-activated cell sorting (FACS) 
(fig. S7B). Although both groups contained a com- 
parable makeup of immune cells (fig. S7A), ApoE/ 
PR3/NE-deficient animals had significantly lower 
total immune cell counts per aorta (fig. S7B), con- 
sistent with reduced inflammation and smaller 
lesions. Adhesion molecule transcripts were similar 
in aortas from both groups, but differences may 
be difficult to detect because of the patchy lesion 
morphology (fig. S12C). 

Neutrophil recruitment was comparable in the 
skin of wild-type and PR3/NE-deficient animals 
treated with Aldara imiquimod (fig. S13, A and B), 
which drives IL-1 exogenously to promote sterile 
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Fig. 4. NETs drive IL-17 and neutrophil chemokine production in atherosclerosis. (A) Representative 
FACS plot of ILT7 intracellular staining in phorbol myristate acetate (PMA)-restimulated ap and y5 T cells 
from digested aortas of ApoE~^~ or ApoE/PR3/NE~^~ mice on HFD for 8 weeks. (B) Representative 
number of cells relative to CD45^ populations and whole aortas from (A) for three animals per group 
repeated in two independent experiments. (C) IL-17A, CXCLl, CXCL2, and CCL2 concentrations from 
whole aorta samples measured by ELISA; n = 5 ApoE~'"~ and n = 4 ApoE/PR3/NE^~ mice. (D) Number of 
total intact Ly6G-stained neutrophils per aortic root section field of view (FOV) measured in immunostained 
micrographs: eight mice per strain in two independent experiments. Statistics by two-tailed, unpaired 
Student’s t test: < 0.05, < 0.001, ****P < 0.0001. 



psoriatic inflammation (26). Hence, the reduc- 
tion in neutrophil recruitment in ApoE/PR3/NE- 
deficient lesions was not due to intrinsic defects in 
neutrophil chemotaxis or extravasation. Therefore, 
NET-mediated priming of macrophages promotes 
a self-amplifying IL-l-IL-17 cascade and uncovers a 
mechanism for neutrophils to regulate T helper 17 
(Th 17) cells that sustains chronic sterile inflam- 
mation (fig. S13C). 

Our data reveal a requirement of NETs as 
priming cues in vivo and show that NETs are 
substantially more potent in priming cytokines 
than in activating the inflammasome. Although 
other endogenous molecules such as oxidized 
LDL can prime in vitro, their importance in vivo 
has not been demonstrated (7, 27). Interestingly, 
antibodies against oxidized LDL in lesions rec- 
ognize oxidized phospholipids that suppress 
inflammation (28). NETosis may prime more 
efficiently than necrosis (29, 30), as it effectively 
exposes highly decondensed and proinflamma- 
tory DNA (31). 

A recent study using Cl-amidine proposed that 
NETs drive a plasmacytoid dendritic cell (pDC)- 
derived interferon-a (IFN-a) autoimmune cascade 
in atherosclerosis via TLR9 ligation (20). However, 
TLR9 deficiency has little effect on atherogenesis 
(32) Moreover, PAD enzymes are expressed in 
many cell types (33) and were dispensable for 
NETosis triggered by cholesterol crystals (fig. SIE). 
Whereas IFN signaling down-regulates IL-ip expres- 
sion, genetic ablation of the IFNa/p receptor yields 
a modest 25% decrease in lesion size (34) because 
IFNs may contribute to pathogenesis via the up- 
regulation of caspases (3, 35, 36). 

NET priming may drive inflammation in other 
NET-associated diseases such as cystic fibrosis 
and rheumatoid arthritis (24, 37). In contrast to 
the broad importance of IL-1 and IL-17, NETs play 
more specialized roles in immune defense (74) 
and NET-deficient individuals are primarily sus- 
ceptible to localized fungal infection (38). Hence, 
blocking NETosis or degrading NETs may help to 
treat inflammatory diseases. 
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HIV-1 VACCINES 

Protective efficacy of adenoviru^rotein 
vaccines against SIV challenges 
in rhesus monkeys 

Dan H. Barouch/’^* Galit Alter, ^ Thomas Broge,^ Caitlyn Linde, ^ 
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Preclinical studies of viral vector-based HIV-1 vaccine candidates have previously shown 
partial protection against neutralization-resistant virus challenges in rhesus monkeys. 

In this study, we evaluated the protective efficacy of adenovirus serotype 26 (Ad26) vector 
priming followed by purified envelope (Env) glycoprotein boosting. Rhesus monkeys 
primed with Ad26 vectors expressing SIVsmE543 Env, Gag, and Pol and boosted with 
ASOlB-adjuvanted SIVmac32H Env gpl40 demonstrated complete protection in 50% of 
vaccinated animals against a series of repeated, heterologous, intrarectal SIVmac251 
challenges that infected all controls. Protective efficacy correlated with the functionality of 
Env-specific antibody responses. Comparable protection was also observed with a similar 
Ad/Env vaccine against repeated, heterologous, intrarectal SHIV-SF162P3 challenges. 
These data demonstrate robust protection by Ad/Env vaccines against acquisition of 
neutralization-resistant virus challenges in rhesus monkeys. 



D espite the urgent need for a safe and ef- 
fective global HIV-1 vaccine, only four 
vaccine concepts have been evaluated for 
protective efficacy in humans during more 
than 30 years (7, 2). In rhesus monkeys, 
vaccine protection has been reported against 
neutralization-sensitive viruses (3), but these 
data failed to predict protective efficacy in hu- 
mans (4), which suggests the importance of 
using neutralization-resistant virus challenges 
for preclinical evaluation of HIV-1 and SIV vac- 
cine candidates. We previously showed that prim- 
ing with adenovirus vectors and boosting with 
poxvirus vectors expressing Env, Gag, and Pol 
resulted in a reduced per-exposure acquisition 
risk after challenges with neutralization-resistant 
SIVmac251, but the majority of these animals 



were infected at the end of the challenge series 
(5, 6). To augment antibody responses, we eval- 
uated the immunogenicity and protective efficacy 
of priming with adenovirus vectors and boost- 
ing with adjuvanted Env gpl40 protein against 
SIVmac251 and SHIV-SF162P3 challenges in rhesus 
monkeys. 

We immunized 32 adult rhesus monkeys 
(Macaca mulatta) that did not express the pro- 
tective major histocompatibility complex class I 
alleles Mamu-A^Ol, Mamu-B^08, or Mamu-BH7 
with adenovirus serotype 26 (Ad26) vectors (7) 
expressing SIVsmE543 Env/Gag/Pol antigens 
(5) followed by either SIVmac32H Env gpl40 
protein (S) (Ad/Env; n = 12) or Ad35 vectors {9) 
expressing SIVsmE543 Env/Gag/Pol antigens 
(Ad Alone; n = 12), and a control group received 



sham vaccines (Sham; n = 8). Animals in the 
Ad/Env group were primed with 3 x 10^^ viral 
particles (vp) Ad26-Env/Gag/Pol vectors (10^° vp 
per vector) by the intramuscular route at weeks 
0 and 24 and were boosted with 0.25 mg Env 
gpl40 with ASOIB Adjuvant System at weeks 52, 
56, and 60. Animals in the Ad Alone group were 
primed with 3 x 10^° vp Ad26-Env/Gag/Pol vec- 
tors at weeks 0 and 24 and were boosted with 

3 X 10^® vp Ad35-Env/Gag/Pol at week 52. One 
control animal died before challenge for rea- 
sons unrelated to the study protocol and was 
excluded from the analysis. 

Binding antibody responses to heterologous 
SIVmac239 Env gpl40 were detected by enzyme- 
linked immunosorbent assay (ELISA) {10) in all 
vaccinated animals after Ad26 priming at weeks 

4 and 28 (Fig. lA). In the Ad/Env group, ELISA 
end-point titers increased from 5.3 logs at week 
28 to 6.4 logs after the SW Env gpl40 boosts at 
week 64 {P < 0.0001) (Fig. lA), which confirmed 
that the Env boost effectively augmented Ad26- 
primed antibody responses. Neutralizing antibody 
(NAb) responses assessed in the TZM-bl cell line 
{11) against tier 1 heterologous SIVmac251_TCLA15 
and homologous SIVsmE660 CP3C-P-A8 viruses 
also increased significantly after SIV Env gpl40 
boosting (fig. SI). NAb responses against tier 2 
viruses were borderline (fig. SI). 

In addition to neutralization, antibodies me- 
diate a wide variety of additional antiviral func- 
tions through their ability to interact with Fc 
receptors, complement, and lectin-like proteins 
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{12, 13). Previous studies showed that antibody- 
dependent cellular phagocytosis (ADCP) {14) and 
antibody-dependent complement deposition (ADCD) 
responses correlated with protective efficacy in 
rhesus monkeys {&). To perform a comprehensive 
analysis of vaccine-elicited antibody responses, 
we evaluated 150 independent antibody Fc pa- 
rameters by high-throughput antibody profiling, 
including multiple assessments of antibody Fc fimc- 
tionality [ADCP, ADCD, antibody-dependent cell- 
mediated cytotoxicity (ADCC), antibody-dependent 
NK cell expression of CDlOTa, interferon-y (IFN-y), 
and the chemokine CCL4], isotypes, glycosyiation, 
complement binding, and Fc receptor binding 
{14-18). Integration of all 3600 data points in a 
principal-component analysis demonstrated that 
the Ad/Env vaccine and the Ad Alone vaccine 
elicited Env-specific antibodies that were pheno- 
typically distinct {P < 0.0001) (Fig. IB). A loadings 
plot (Fig. 1C) showed the distribution of all mea- 
sured Fc features in the same multidimensional 
space, which demonstrated the specific features 
that drove the separation of antibody profiles 
(red arrows). Partial least-squares discriminant 
analysis {19) revealed that the six antibody Fc 
functions described above nearly completely sep- 



arated these groups, with the majority of anti- 
body Fc effector functions clustering with the 
Ad/Env-vaccinated animals (Fig. ID). Univariate 
analyses showed that these antibody Fc func- 
tions were all significantly increased in Ad/Env 
group as compared with the Ad Alone group 
(Fig. IE), and a combined analysis demonstrated 
that the number of antibody Fc functions was 
significantly greater in Ad/Env-vaccinated ani- 
mals as compared with animals vaccinated with 
Ad Alone (Fig. 1, F and G). These data show that 
the protein boost resulted in a more polyfunc- 
tional antibody Fc effector profile. 

Cellular immune responses measured by IFN-y 
enzyme-linked immunospot (ELISPOT) assays in 
response to heterologous SIVmac239 and homol- 
ogous SIVsmE543 Env/Gag/Pol peptide pools 
were also detected in aH animals after vaccination 
(fig. S2). By multiparameter intracellular cytokine 
staining assays, SIV Env gpl40 boosting primarily 
expanded Env-specific IFN-y ^CD4^ T lymphocyte 
responses in the Ad/Env group, whereas Ad35- 
Env/Gag/Pol boosting substantially expanded 
IFN-y^CD8^ T lymphocyte responses in the Ad 
Alone group (fig. S2). Both CD28^CD95^ central 
and transitional memory and CD28^CD95" effector 



memory CD4^ and CD8^ T lymphocyte responses 
{20, 21) were elicited by both vaccines (fig. S3). 

To evaluate the protective efficacy of these vac- 
cine regimens, all animals were challenged with 
six repeated, intrarectal inoculations with 500 tis- 
sue culture infectious doses (TCID50) of the het- 
erologous, neutralization-resistant virus SIVmac251 
(5, 22, 23) beginning week 96 (Fig. 2, A and B). All 
control animals were infected by this challenge 
protocol. The Ad Alone vaccine regimen resulted 
in a 75% reduction in the per-exposure acquisition 
risk as compared with controls [1 - (hazard ratio); 
P = 0.039, Cox proportional hazard model], which 
is consistent with our prior studies (5). In con- 
trast, the Ad/Env vaccine regimen afforded a 90% 
reduction in the per-exposure acquisition risk as 
compared with controls {P = 0.001). Moreover, 
50% (6 of 12) of animals in this group also ap- 
peared uninfected at the end of this challenge 
protocol {P = 0.012 compared with controls, chi- 
square test; P = 0.044, Fisher’s exact test) (Fig. 2B). 
Protection in the Ad/Env group was greater than 
that in the Ad Alone group {P = 0.042, chi-square 
test; P = 0.097, Fisher’s exact test). Binding anti- 
body titers [P < 0.0001; correlation coefficient 
{R) = 0.75] and antibody Fc polyfunctionality 
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Fig. 1. Humoral immune responses elicited by the Ad26/Env SIV vaccine. (A) SIVmac239 Env-specific ELISA titers at weeks 0, 4, 

28, 64, and 96. Mean log end-point ELISA titers are shown. (B to G) Multidimensional analysis integrating 150 antibody Fc parameters 
per animal demonstrates differences in antibody Fc profiles. (B) Principal component analysis (PCA) of antibody Fc profiles from 
animals immunized with Ad/Env (red) or Ad Alone (blue) vaccines. (C) Loadings plot mirrors the multidimensional space in the PCA 
but shows the distribution of all measured Fc features, which demonstrates the features that drove the separation of antibody profiles 
(red arrows). (D) Partial least-squares discriminant analysis (PLSDA) of antibody profiles from Ad/Env and Ad Alone vaccinees. (E) Univariate analyses of 
antibody functions identified in (D). A composite (F) dot plot and (G) pie chart show the overall functionality of antibody responses elicited by the Ad/Env 
and the Ad Alone vaccines. Error bars reflect SEM. P values reflect Mann-Whitney tests. *P < 0.01, **P < 0.001, ***P < 0.0001. 
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(P = 0.004; R = 0.56) best correlated with pro- 
tection against acquisition of infection, as mea- 
sured by the number of challenges required for 
infection (Fig. 2C). Individual antibody functions 
(ADCP, ADCC, CD107, CCL4) also correlated v^th 
protection (P < 0.05). 

In the Ad/Env group, plasma viral loads were 
persistently negative in the six protected mon- 
keys for 400 days after challenge (Fig. 2D). Of 
the six infected animals in this group, four ani- 
mals developed measureable chronic set-point 
viremia, whereas two monkeys exhibited tran- 
sient acute viremia and subsequently became elite 
controllers with undetectable plasma viral loads 
(Fig. 2D and fig. S4). In the Ad Alone group, plas- 
ma viral loads were persistently negative in 2 of 
12 monkeys, and chronic viremia developed in 
10 of 12 animals. In contrast, all sham controls 
developed high levels of chronic viremia with a 
median set-point viral load from days 100 to 
400 after infection of 6.03 log copies/ml, which 
was at least 1.65 logs as high as the median set- 
point viral load in the animals in the Ad/Env 
group that became infected (P = 0.035) (fig. S5). 



We previously reported that progressive SIV 
infection correlated with a marked expansion of 
the enteric virome in rhesus monkeys, particu- 
larly for picomavirus reads (24-27). Metagenom- 
ics sequencing of stool samples in the present 
study demonstrated that the enteric virome ex- 
panded by week 28 but not by week 10 in the 
sham controls (P = 0.015) (fig. S6). Both the Ad/Env 
and the Ad Alone vaccines reduced the expan- 
sion of total enteric reads including enteric picor- 
naviruses (P = 0.002 and P = 0.042, respectively) 
(fig. S6). The Ad/Env vaccine also reduced AIDS- 
related mortality as compared v^th the sham 
controls (P = 0.020) (fig. S7). 

We next investigated whether the vaccinated 
animals that exhibited persistently negative plas- 
ma viral loads were completely protected by com- 
prehensive tissue analyses, adoptive transfer 
studies, and immunologic assays. We performed 
necropsies on the six protected animals in the 
Ad/Env group, the two protected animals in the 
Ad Aone group, and one of the elite controllers in 
the Ad/Env group at -400 days after challenge 
(Fig. 2D). AU of these animals had negative plasma 



viral loads at the time of necropsy. We assessed 
36 gastrointestinal, lymphoid, and reproductive 
tract tissues per animal (28 tissues in males) by 
ultrasensitive nested quantitative polymerase chain 
reaction or quantitative reverse transcription poly- 
merase chain reaction assays for SIV DNA and 
SIV RNA as previously described (28). Viral DNA 
and RNA were readily detectable in all tissues in 
the elite controller (Fig. 3A, red circles) but not 
in the eight protected animals (Fig. 3A, black cir- 
cles), except for one viral signal in a single animal, 
which is within the range of expected background 
false-positive signals in similar analyses of naive 
animals (28). 

We next performed adoptive transfer studies 
and infused 60 million peripheral blood and lymph 
node mononuclear cells by the intravenous route 
from the eight apparently protected animals and 
the two elite controllers into naive rhesus mon- 
key hosts. Cells from the elite controllers readily 
transferred infection and resulted in plasma 
viral loads of 6.87 to 7.12 log copies/ml in naive 
recipients by day 14 after adoptive transfer (Fig. 
3B, red lines). In contrast, cells from the 
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Fig. 2. Protective efficacy 
of the Ad26/Env SIV 
vaccine against repeated, 
intrarectal SIVmac251 
challenges. (A) Number of 
challenges required for 
acquisition of infection in 
each vaccine group. (B) 
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exact tests. (C) Correlation 

of log ELISA titers and antibody Fc polyfunctionality at week 64 with the number of challenges required to establish infection. The plotted data reflect only 
vaccinated animals and do not include the sham controls. Values plotted as >6 challenges reflect animals that remained uninfected. Overlapping data 
points are shown as a single symbol. P values reflect Spearman rank correlation tests. (D) Plasma SIV RNA copies/ml over time in infected and protected 
animals in each vaccine group. Red asterisks indicate elite controllers. indicates mortality. 
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protected animals failed to transfer infection 
(Fig. 3B, black lines). 

Furthermore, the protected animals exhibited 
no increase in Env/Gag/Pol-specific cellular im- 
mune responses after challenge and also no re- 
sponses to Vif, which was not included in the 
vaccine, whereas vaccinated animals that became 
infected developed massive anamnestic Env/Gag/ 
Pol-specific cellular immune responses and pri- 
mary Vif-specific responses (fig. S8). The protected 
animals also exhibited no anamnestic Env-specific 
ELISA antibody responses after challenge. Taken 
together (Fig. 3 and fig. S8), these data strongly 
suggest that the Ad/Env vaccine afforded com- 
plete sterilizing protection in 50% of animals against 
the SIVmac251 challenge protocol and that the 
mechanism of protection involved primaiy block- 
ing of acquisition of infection. 

To confirm these findings with analogous vac- 
cines expressing HIV-1 immunogens, we used a 



group of 20 rhesus monkeys that had been im- 
munized previously at weeks 0 and 40 with 
Ad26 and Ad5HVR48 vectors expressing mo- 
saic, consensus, or natural clade C HIV-1 Env/ 
Gag/Pol immunogens (29). Two years after Ad 
priming, these animals were boosted six times 
with 0.25 mg HIV-1 clade C C97ZA012 Env gpl40 
(10, 30) with the ASOIB Adjuvant System at weeks 
156, 160, 164, 176, 180, and 184 (Ad/Env; n = 20). 
A second group of animals received only 0.25 mg 
Env gpl40 with ASOIB at the same six time points 
(Env Alone; n = 8), and a third control group re- 
ceived sham vaccines (Sham; n = 12). The Ad/Env 
vaccine elicited greater antibody responses than 
did the Env Alone vaccine by ELISA (fig. S9); fimc- 
tional nonneutralizing antibody assays (6, 14, IS) 
(fig. S9); tier 1 NAb TZM-bl assays (fig. SIO); tier 2 
NAb A3R5 assays (fig. Sll); and linear peptide 
microarray assays (31, 32), including variable re- 
gion 2-specific responses (33, 34) (fig. S12). Pro- 



tective efficacy was assessed by six intrarectal 
challenges with 500 TCID50 of the heterologous, 
neutralization-resistant virus SHIV-SF162P3 (6) 
beginning week 196 (Fig. 4, A and B). Although 
the Env Alone vaccine afforded only minimal 
protection, 40% (8 of 20) of Ad/Env-vaccinated 
animals were completely protected against this 
challenge series (P = 0.006 compared with con- 
trols, chi-square test; P = 0.014, Fisher’s exact test) 
(Fig. 4B and figs. S13 to S15). Binding antibody 
titers (P = 0.008) and ADCP responses (P = 0.001) 
correlated with protection against acquisition 
of infection. 

Our data demonstrate the protective efficacy 
of Ad/Env vaccine regimens against SIVmac251 
and SHIV-SF162P3 challenges in rhesus monkeys 
and suggest that the Env protein boost improved 
protective efficacy by enhancing the functional- 
ity of vaccine-elicited, Env-specific antibody re- 
sponses. In contrast, DNA/Ad5 vaccines afforded 
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Fig. 3. Tissue analyses and adoptive transfer studies in SIVmac251 challenged animals. 

(A) Ultrasensitive SIV DNA and SIV RNA assays in multiple tissues from one elite controller (red) 
and the eight apparently completely protected monkeys (black) after necropsy. SIV copies/10^ cell 
equivalents are shown. (B) Plasma SIV RNA copies/ml in previously naive recipient monkeys after 
adoptive transfer of 60 million peripheral blood and lymph node mononuclear cells from the two 
elite controllers (red) or the eight apparently completely protected monkeys (black). Only 1 week 
of follow-up was available for one of the protected animals. 
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no protection against SIVniac251 challenges (3), 
which reflects the ability of DNA/Ad5 vaccines 
to block only neutralization-sensitive virus clones 
(35). Alphavirus vector priming and Env pro- 
tein boosting afforded partial protection against 
the neutralization-sensitive virus SHIV-SF162P4 
but was not evaluated against neutralization- 
resistant viruses (36). Rhesus cytomegalovirus 
(CMV) vectors failed to block acquisition of in- 
fection but afforded post-infection virologic con- 
trol and eventual viral clearance in about half of 
the animals after SIVmac239 challenges (28, 37). 

The protective efficacy of Ad/Env vaccines 
against acquisition of neutralization-resistant 
virus challenges in rhesus monkeys in the present 
study has important implications for HIV-1 vac- 
cine development and suggests the potential of 
Env protein boosting after Ad vector priming. 
Nevertheless, important differences exist be- 
tween SIV/SHIV infection in rhesus monkeys 
and HIV-1 infection in humans. Clinical efficacy 
studies are therefore required to determine the 
protective efficacy of these HIV-1 vaccine candi- 
dates in humans. 
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CIRCADIAN RHYTHMS 

A protein fold switch joins the 
circadian oscillator to clock output 
in cyanobacteria 

Yong-Gang Chang/ Susan E. Cohen, ^ Connie Phong, ^ William K. Myers,^ 

Yong-Ick Kim,^ Roger Tseng,^’^ Jenny Lin,^ Li Zhang,^ Joseph S. Boyd,^ Yvonne Lee,® 
Shannon Kang,® David Lee,’^ Sheng Li,’^ R. David Britt,^ Michael J. Rust,^ 

Susan S. Golden,^’® Andy LiWang^’^’^’®’®* 

Organisms are adapted to the relentless cycles of day and night, because they evolved 
timekeeping systems called circadian clocks, which regulate biological activities with 
~24-hour rhythms. The clock of cyanobacteria is driven by a three-protein oscillator 
composed of KalA, KaiB, and KaiC, which together generate a circadian rhythm of KaiC 
phosphorylation. We show that KaiB flips between two distinct three-dimensional folds, 
and its rare transition to an active state provides a time delay that is required to match the 
timing of the oscillator to that of Earth’s rotation. Once KaiB switches folds, it binds 
phosphorylated KaiC and captures KaiA, which initiates a phase transition of the circadian 
cycle, and it regulates components of the clock-output pathway, which provides the link 
that joins the timekeeping and signaling functions of the oscillator. 



E ndogenous circadian 24-hour) rhythms 
are found in diverse organisms, arising 
as an adaptation to Earth’s persistent 
cycles of night and day (1). To uncover the 
molecular mechanism of a circadian clock, 
we chose the cyanobacterial system because its 
oscillator can be reconstituted in vitro (2). The 
oscillator is composed of only three proteins KaiA, 
KaiB, and KaiC (3), which together generate a 
circadian rhythm of KaiC phosphorylation at 
residues serine 431 (S431) and threonine 432 
(T432) in the CII domain (4, 5). KaiA promotes 
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KaiC (auto)phosphorylation during the subjec- 
tive day (4, 6), whereas KaiB provides negative 
feedback to inhibit KaiA (7, 8) and promotes 
KaiC (auto)dephosphorylation during the sub- 
jective night. KaiB is also involved in regulating 
two antagonistic clock-output proteins— SasA 
(9) and CikA (10), which reciprocally control the 
master regulator of transcription, RpaA (11). 

To determine the structure of KaiB in its 
KaiC-bound state, we used a monomeric var- 
iant of the KaiB-binding domain of KaiC, Cl*, 
and a dimeric KaiB variant (12), KaiB*, with en- 
hanced KaiC binding. Dimeric forms of free KaiB 
retain the same tertiary structure in crystals as 
tetrameric forms (13). Free KaiB has been shown 
by x-ray crystallography (14) to adopt a fold 
found in no other protein (75), despite clear se- 
quence similarity with the thioredoxin-like fold 
at the N terminus of SasA, N-SasA (9). For struc- 
tural studies, we used proteins from Thermo- 
synechococcus elongatus (denoted by ^^), because 
they are more stable than those from Synecho- 
coccus elongatus (16). For functional studies, we 
used proteins from S. elongatus (denoted by ®^), 
the standard model for investigating in vivo 



324 17 JULY 2015 • VOL 349 ISSUE 6245 



sciencemag.org SCIENCE 



RESEARCH \ REPORTS 



circadian rhythms (17). Analytical ultracentrifiiga- 
tion experiments indicated that KaiB^* binds to 
as a monomer with a stoichiometric ratio of 
1:1 (fig. SIA). Secondary chemical shifts of back- 
bone resonances (18) of KaiB^^* in a complex 
with (fig. SI) revealed a thioredoxin-like 
secondary structure (papappa) (19), rather than 
the secondary structure of free KaiB (PaPPaaP) 
found in protein crystals (Fig. lA). Hereafter, 
we refer to the PappaaP form of KaiB as the 
ground state (gsKaiB), and the PaPaPPa state as 
fold-switched (fsKaiB). Fewer than 10 proteins 
are known to switch reversibly between distinct 
folds under native conditions, and they are col- 
lectively known as metamorphic proteins (20). 
KaiB is the only metamorphic protein known to 
function in biological clocks. 

Along a p strand, side chains typically alter- 
nate titi"" In the P4 strand of gsKaiB, the side 
chain pattern is ti-t? where the dash is G89; in 
fsKaiB, G89 lies in the a3 helix. We reasoned 
that a G89A substitution would destabilize P4 in 
gsKaiB but not a3 in fsKaiB (Fig. IB). A D91R sub- 
stitution should also destabilize gsKaiB. Nuclear 
magnetic resonance secondary chemical-shift 
analysis revealed that, unlike KaiB^^*, the two 
single-point mutants had populations of both 
gsKaiB and fsKaiB states, but the double mu- 
tant was >98% in the fsKaiB state (figs. S2 to 
S6). A structural model of G89A,D91R-KaiB^^* 
determined by CS-Rosetta (21), using chemical 
shifts and backbone amide nuclear 

Overhauser effects as restraints, confirmed that 
G89A,D91R-KaiB^^* adopted a thioredoxin-like 
fold (fig. S7), similar to that of N-SasA (22). 

The corresponding KaiB variants in S. elongatus, 
G88A-KaiB"^ D90R-KaiB"^ and G88A,D90R- 
KaiB®^ also promoted the fsKaiB state relative 
to wild-type (WT) KaiB®^ (figs. S8 and S9), al- 
though to a lesser extent than in the correspond- 
ing T. elongatus variants. G88A,D90R-KaiB®^ 



formed a complex with with near-complete 
binding within 5 min (fig. SIO). In contrast, WT 
KaiB®® bound Cl®®* marginally, even after 24 hours 
(fig. Sll). In vitro oscillation assays showed that 
the KaiB®® variants disrupted KaiC®® phospho- 
rylation rhythms (Fig. 2A and fig. S12). Amounts 
of KaiC®® phosphorylation were larger in the 
presence of D90R-KaiB®® or G88A,D90R-KaiB®® 
than they were with G88A-KaiB®®. G88A-KaiB®® 
formed a complex with KaiA®® (Fig. 2B), whereas 
the two KaiB®® mutants containing D90R did 
not (fig. S13), which indicated that although 
the D90R mutation promotes the fsKaiB state, 
it also disrupts binding. Larger amounts of un- 
sequestered KaiA would be expected to lead to 
larger amounts of KaiC phosphorylation, which 
would account for the observed differences in 
the KaiC phosphorylation profiles when the D90R 
mutants were used. In vivo bioluminescence 
rhythms from cyanobacterial luciferase report- 
er strains harboring kaiB^^ variants (Fig. 2C) 
were also disrupted, with phenotypes that close- 
ly matched the in vitro phosphorylation patterns 
(Fig. 2A). A chromosomal copy of kaiB7 in addi- 
tion to the mutant kaiB variants did not restore 
bioluminescence rhythms in vivo, which indi- 
cated a dominant-negative effect of the fold- 
switch mutations (Fig. 2D and fig. S14). KaiB 
fold switch-stabilizing mutants are less abun- 
dant in vivo than WT KaiB (fig. S15), so at equi- 
librium in vivo fsKaiB is probably rare. 

Although KaiB®® variants disrupted rhythms 
in vitro and in vivo, each of them restored the 
cell-length phenotype in kaiB~ strains (Fig. 2, E 
and F) in which the SasA-RpaA output path- 
way was hyperstimulated and cell division was 
inhibited (23). These functional variants indi- 
cate that fsKaiB regulates the clock-output 
enzymes SasA (9) and CikA (10, 23). SasA is 
activated when it binds KaiC (24), and SasA 
and KaiB compete for the Cl domain of KaiC 



(25). Preincubation with the D90R-KaiB®® and 
G88A,D90R-KaiB®® variants inhibited the abil- 
ity of S431E-KaiC®® (a KaiC variant that mimics 
phosphorylation at the S431 residue) to trigger 
output signaling through SasA®® (Fig. 2G and 
fig. S16). Also, mixtures of KaiB and KaiC ac- 
tivate the phosphatase activity of CikA that de- 
phosphorylates RpaA (26). Relative to WT KaiB®®, 
fsKaiB®® variants enhanced CikA®® phosphatase 
activity by about threefold in vitro (Fig. 2H and 
fig. S17) and suppressed RpaA®® phosphorylation 
in vivo (fig. S18). We propose that fsKaiB forms 
a complex with KaiC that both activates signal- 
ing through CikA and inhibits signaling through 
SasA by outcompeting SasA for binding to KaiC. 
It is likely that fsKaiB interacts with the CikA 
pseudo-receiver domain (PsR-CikA), because 
adding PsR-CikA to the in vitro oscillator short- 
ened the period and reduced the amplitude 
(Fig. 21 and fig. S19). Overexpression of just the 
PsR-CikA domain in cyanobacteria similarly 
shortened the period of bioluminescence rhythms 
(27). Interaction with PsR-CikA was detected 
for G88A,D90R-KaiB®®, but not KaiB®® (fig. S20). 
Addition of the PsR domain of KaiA did not af- 
fect phosphorylation rhythms (Fig. 2J), even 
though it has the same tertiary structure as 
PsR-CikA (6, 28). 

fsKaiB/gsKaiB equilibrium constants, K = 
k^Jk^_, were estimated by fitting the kinetics 
of binding of KaiB^® variants to CI^®* (fig. S21). 
Equilibrium constants were larger for G89A- 
KaiB^® (K = 0.13 + 0.02), D91R-KaiB^® (1.2 + 0.1), 
and G89A,D91R-KaiB^® (6.7 ± 0.4), relative to 
KaiB^® (0.08 + 0.01). For KaiB^® variants bind- 
ing S431E-KaiC^®, a distinct fast-binding phase 
was followed by slow binding (Fig. 3A). These 
multiphase binding kinetics were reproduced 
in a gsKaiB o fsKaiB fold-switching model (Fig. 
3B), in which KaiB was initially at equilibrium 
between the two folds. The pool of fsKaiB bound 
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Fig. 1. KaiB switches its fold to bind KaiC. (A) Plots of chemical shift-based 
secondary structures of KaiB*®* KaiB*®* Cl*®*, and G89A,D91R-KaiB*®* determined 
by TALOS-t- (18). Unassigned proline and nonproline residues are indicated by small 
red and blue dots along the horizontal axis at y = 0. The secondary structures of 
KaiB*® and G89A,D91R-KaiB*®* are shown for comparison. KaiB*®* residues Q52 to 
E56 in the KaiB*®* to Gl*®* complex were not assignable, probably owing to exchange 
broadening. Vertical dashed lines are visual guides separating the N-terminal and 
G-terminal halves of KaiB. (B) Structural comparisons of KaiB*®, G89A,D91R-KaiB*®* 
and N-SasA®®. Residues K58, G89, and D91 are highlighted for their roles in fold 
switching. 
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rapidly upon adding S431E-KaiC (Fig. 3C), fol- 
lowed by a slow gsKaiB ^ fsKaiB population 
shift. As shown by our computational model 
(Fig. 3B and fig. S22), the slow formation of the 
KaiB-KaiC complex contributes to the delay 
that allows a population of KaiC proteins to be- 
come highly phosphorylated under continued 
stimulation by KaiA. Increasing the rate of KaiB- 
KaiC binding in our model causes the in silico 
phosphorylation rhythm to fail (Fig. 3D), similar 
to that observed in vitro. Comparing the kinetics 
of binding to Cl (fig. S21) and full-length KaiC 
(Fig. 3A) by KaiB mutants shows that full-length 
KaiC contributes to the slow phase as well, and 



this is likely due to the Cl adenosine triphospha- 
tase (29) exposing the KaiB-binding site upon its 
activation by CI-CII ring stacking (12). 

N-SasA, which also adopts a thioredoxin-like 
fold (22), competes with KaiB for binding KaiC 
(25) (fig. S23). Intermolecular distances for the 
complexes CI^^*-G89A,D91R-KaiB^^* and Cf 
N-SasA^^ (figs. S24 and S25) were measured by 
using the pulsed electron paramagnetic reso- 
nance method of double electron-electron reso- 
nance (DFFR), in combination with mutagenesis 
studies (figs. S26 to S29). The higher-quality DEER 
data for the CI^^*-N-SasA^ complex allowed struc- 
tural modeling (Fig. 4A and figs. S30 and S31). 



In this model, the a2 helix of N-SasA binds to the 
B-loop region of the Cl domain (residues A109to 
D123), with additional interactions involving the 
Cl a-helix that follows. Hydrogen-deuterium ex- 
change mass spectrometry (HDX-MS) data on 
both complexes (Fig. 4, A and B, and figs. S32 
to S35) also suggest that this region of KaiC is a 
common binding site for KaiB and SasA. This 
model is consistent with a report that truncation 
of the B loop abolished binding (25). B-loop trun- 
cation also restored the cell-length phenotype of 
a strain that lacked kaiB (fig. S36). Furthermore, 
a F121A mutation in the B loop of full-length KaiC 
(F121A-S431E-KaiC®^) abolished KaiC binding 
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Fig. 2. KaiB fold switching regulates oscillator function and clock output. 

(A) In vitro KaiC phosphorylation assays using KaiC^®, KaiA^®, and KaiB^®, 
G88A-KaiB^^ D90R-KaiB^^ or G88A,D90R-KaiB^®. (B) Gel-filtration profiles 
of G88A-KaiB^^ KaiA^®, and G88A-KaiB^® -r KaiA^®. Peaks (a) to (c) were ana- 
lyzed by SDS-PAGE (fig. S13). (C) Bioluminescence from strains that carry a 
PkaiB luc reporter for circadian rhythmicity. Gells harbored kaiB^^, G88A-kaiB^^, 
D90R-kaiB^^, or G88A,D90R-kaiB^^, or cells with kaiB^^ deletion. Time in LL, time 
under low-light conditions. (D) Bioluminescence from strains that carry a PkaiB 
luc reporter expressing G88A-kaiB^^, D90R-kaiB^^, G88A,D90R-kaiB^^ 
or empty vector, in addition to chromosomal kalB^^. (E) Representative mi- 
crographs of cells expressing ka/B^®, lacking ka/8^®, or harboring G88A,D90R- 
kalB^^. Gellular autofluorescence in red. Scale bars, 2.5 gm. (F) Histograms 
showing cell-length distributions of strains expressing kalB^^, AkaiB^^, G88A- 
kalB^^, D90R-kaiB^^, or G88A,D90R-kaiB^^ as the only copy of kaiB. (G) SasA^® 



kinase activities in the presence of S431E-KaiG^® and KaiB^®, G88A-KaiB^®, 
D90R-KaiB^®, or G88A,D90R-KaiB^®.The mixtures were incubated for 2 hours 
before SasA^®, RpaA^® and [y-^^PJATP (labeled adenosine triphosphate) were 
added. Relative kinase activities compare the mean steady-state amount of 
^^P-labeled RpaA^® to that of a reaction of S431E-KaiG^® alone (n = 4, error bars 
denote SEM). One-way analysis of variance (ANOVA) gives P < 0.001, and **** 
denotes Bonferroni-corrected values (P < 0.001) for pairwise comparisons 
against kinase activity with KaiB^® (a = 0.05). (H) GikA phosphatase activity 
toward phosphorylated RpaA in the presence of KaiG^® and KaiB®®, G88A- 
KaiB®®, D90R-KaiB®®, or G88A,D90R-KaiB®® (n = 4-5, error bars denote SEM). 
KaiG®® alone or KaiB®® alone did not activate GikA phosphatase activity (fig. 
S17). (I) In vitro KaiG®® phosphorylation assays as a function of concentration 
of PsR-GikA®®. (J) Same as (I) except for using PsR-KaiA®® instead of PsR-GikA®®. 
The black curves in (I) and (J) are identical. 
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to full-length SasA"^ KaiB"^ and G88A,D90R- 
KaiB"^ (figs. S37 to S40). HDX-MS data (Fig. 4, A 
and B) indicated that N-SasA and KaiB induce 
long-range perturbations in isolated Cl domains 
that, in the context of full-length KaiC, may affect 



intersubunit interactions. Another HDX-MS- 
derived model (30) indicated that KaiB most 
likely binds to CII, on the basis of a calculation 
minimizing the docking energy between KaiC 
and gsKaiB, not fsKaiB. 



Naturally occurring variations at residue po- 
sitions 58, 89, and 91 of KaiB (fig. S41A) suggest 
that the ancestral KaiB protein had the fsKaiB 
thioredoxin-like fold, and the gsKaiB fold evolved 
later with the circadian clock. In support of 




Fig. 3. KaiB fold-switching 
regulates slow formation of 
the KaiB-KaiC complex. (A) 

Fluorescence anisotropies of 
6-iodoacetamidofluorescein 
(6-IAF)-labeled KaiB*®, 

G89A-KaiB*®, D91R-KaiB*®, and 
G89A,D91R-KaiB*® in the presence 
of S431E-KaiC*®. KaiB samples 
were incubated for 1 hour (circles) 
before addition (arrow) of 
S431E-KaiC‘®. A54C mutation 
was introduced to all KaiB for 
fluorescence labeling. (B) Scheme 
for modeling. (C) Forward fold- 
switching rate constants, /cb+ 

(maroon), and burst-phase binding 
to S431E-KaiC*® (tan). Burst-phase 
binding— defined as the percent- 
age of KaiB*®-S431E-KaiC*® com- 
plexes formed at t = 0.1 hours in the model relative to steady-state binding at 
f = 24 hours— were derived from fitting data after adding S431E-KaiC^® in (A) 
to the model shown in (B). Burst-phase error bars show the standard de- 
viation from model calculations by bootstrap resampling the raw data (n = 20). 
ks+ values used in these fits were predetermined from analysis of the kinetics 
of binding of KaiB^® variants to the isolated Cl*®* domain (fig. S21), a condi- 
tion where we assumed the rate-limiting step in complex formation is due 



only to KaiB fold switching. Error bars for ks+ were estimated by bootstrap 
resampling the original data set 500 times. (D) Mathematical modeling of 
KaiC phosphorylation period (black) and probability of stable oscillation 
(purple), as a function of kB+.The black bars indicate the standard deviation 
in the model period from 100 oscillator calculations at each value of ks+, 
with the other parameters randomly varied as described in supplementary 
materials. 




Fig. 4. KaiB and SasA bind to similar sites on Cl. (A) An electron paramagnetic resonance)-restrained model of the CI*®*-N-SasA*® complex. The 
HADDOCK model of the complex with the best score is superimposed on the crystal structure of KaiC*® (PDB ID: 4o0m). (B) Qualitative structural model of 
the interaction of Cl*®* and fsKaiB (G89A,D91R-KaiB*®*) based on HDX-MS data and mutagenesis. Dark blue and cyan spheres represent Cl residues whose 
mutations strongly or moderately weaken binding, respectively. Dark blue and cyan ribbons represent protection against H/D exchange upon complex 
formation that are >1.5 and 0.5 to 1.5 standard deviations above the average, respectively, as determined by HDX-MS (figs. S32 to S35). 
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Fig. 5. Model of KaiB fold 
switching as linchpin for the 
cyanobacterial clock. Excur- 
sion of KaiB to the rare fold- 
switch state causes fsKaiB to 
displace SasA for binding to 
KaiC. KaiC-stabilized fsKaiB 
captures KaiA, initiating the 
dephosphorylation phase of 
the cycle. These aspects con- 
trol oscillator period. CikA and 
KaiA compete for binding to 
fsKaiB, which further links 
oscillator function related to 
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gsKaiBiJ^ . Q fsKaiB 



Oscillator 



KaiA and output activity via 
CikA-mediated dephosphoryl- 
ation of RpaA.The competitive 
interactions of fsKaiB with 
SasA, and KaiA with CikA, implicate 



Output 
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rhythmic gene expression 
cell division 

“output components” CikA and SasA as parts of an extended oscillator. 



this notion, a homolog of KaiB from Legionella 
pneumophila, with no known circadian rhythms, 
has an alanyl residue at position 89 and crys- 
tallizes in the fsKaiB fold {31) (fig. S41B). Rare 
excursions of KaiB between two distinct folds are 
essential for a robust circadian period and re- 
ciprocally regulate mutually antagonistic clock 
output-signaling pathways (Fig. 5). 
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foxl3 is a germ cell-intrinsic factor 
involved in sperm-egg fate decision 
inmedaka 

Toshiya Nishimura/’^ Tetsuya Sato,^’^ Yasuhiro Yamamoto,^ Ikuko Watakabe/ 

Yasuyiiki Ohkawa,^ Mikita Suyama,^’^ Satoru Kobayashi,^’®* Minoru Tanaka^’^f 

Sex determination is an essential step in the commitment of a germ cell to a sperm or 
egg. However, the intrinsic factors that determine the sexual fate of vertebrate germ 
cells are unknown. Here, we show that fox/3, which is expressed in germ cells but not 
somatic cells in the gonad, is involved in sperm-egg fate decision in medaka fish. Adult XX 
medaka with disrupted fox/3 developed functional sperm in the expanded germinal 
epithelium of a histologically functional ovary. In chimeric medaka, mutant germ cells 
initiated spermatogenesis in female wild-type gonad. These results indicate that a germ 
cell-intrinsic cue for the sperm-egg fate decision is present in medaka and that 
spermatogenesis can proceed in a female gonadal environment. 
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I n vertebrates, gonadal somatic cells instruct 
germ cells to adopt their sexual fates. In 
medaka {Oryzias latipes), the expression of 
DMY/dmrtlbYin the supporting somatic cells 
is critical for the fate decision of germ cells to 
enter spermatogenesis {1, 2). However, the molec- 
ular mechanism underlying the sexual fate deci- 
sion in germ cells remains unknown. In this study, 
we show that foxl3 serves as a germ cell-intrinsic 
cue for sperm-egg fate decision. 

foxl3 is an ancient duplicated copy of foxl2 
that is expressed in gonads of teleost fish (3-5). 
Although foxl2 is known to be essential for ovarian 
development and maintenance {6-9), the function 
of foxl3 is not ]mown.foxl3 transcripts and FOXL3 
protein (foxl3/¥OXL3) were first detectable in germ 



cells of both XX and XY embryos at stage 35, the 
time of onset of gonadal sex differentiation (figs. SI, 
A to D, and S2, A to D). After this stage in XX 
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Fig. 1. Dimorphic expression of FOXL3 during gonadal development. (A) Schematic representation 
of type I and type II division and expression of FOXL3 protein (red) in germ cells (green). “Active” and 
“quiescent” indicate mitotic states. (B) Ventral view of the gonad, observed with confocal laser 
microscopy. FOXL3 expression (red) in XX gonad at 7 days after hatching. FOXL3 signals are detected in 
a subset of type I (1) and all mitotic type II (2, 2-cell cyst; 3, 8-cell cyst) germ cells, but not in meiotic 
germ cells (4, zygotene: 5, pachytene) or diplotene oocytes (6). (C) The number of FOXL3-positive cells 
during gonadal development in XX and XY gonads. Values are expressed as means ± SEM. The number 
of samples examined at each stage is shown in fig. S2. st., stage: dph, days post-hatching. Scale bars, 
10 |im. 



gonads, germ cells can be categorized into two 
types according to their division (Fig. lA) (10, IT). 
Type I is stem-type self-renewal division, in which 
germ cells divide completely to generate two iso- 
lated daughter cells surrounded by supporting 
cells. Type I includes both mitotically active and 
quiescent germ cells (12). Type II is cystic division, 
in which germ cells divide synchronously vrith 
intercellular bridges. Type II is gametogenesis- 
committed division, followed by meiosis and 
oogenesis. In XY gonads, the transition from 
type I to type II is suppressed until 1 month after 
hatching. 

foxlS/EOXLS appeared in a subset of mitoti- 
cally active type I germ cells, but not in quiescent 
type I germ cells, at stage 35 onward in both XX 
and XY gonads (Fig. 1, A and B, and fig. S3). In 
XX gonads, the signals continued to be detected 
in type II germ cells but disappeared in meiotic 
germ cells and oocytes (Fig. IB and fig. SI, E and 
G). Whereas foxlS/EOXlS were detected through- 
out gonadal development in XX fish, they disap- 
peared in all germ cells of XY fish by 10 days after 
hatching (Fig. 1C and figs. SI and S2). 

To examine the function of foxlS in the gonad- 
al sex differentiation, we generated transcription 
activator-like effector nuclease (TALEN)-induced 
mutants of foxlS. We designed two types of 
TALENs targeting foxlS and obtained two dif- 
ferent mutant alleles (NA17 and FHA8), which 
contain frameshifts causing premature trunca- 
tion upstream of and at the forkhead domain of 
FOXL3, respectively (fig. S4, A to D). Antiserum 
recognizing the C terminus of FOXL3 protein 
detected the protein in both heterozygous (+/-) 
mutants but failed to detect it in both homozy- 
gous (-/-) mutants (fig. S4, E to H), indicating 
that foxlS was successfully disrupted by TALEN. 
One week after hatching, a stage at which oocytes 
are formed in vrild-type XX gonads (Fig. IB), the 
foxlS~^~ XX gonads had no oocytes but were 
instead filled vrith cystic and meiotic germ cells 
(Fig. 2, A to C), suggesting that commitment to 
type II division and meiosis is not affected by loss 




Fig. 2. fox/3 is responsible for suppressing initiation of spermatogenesis in medaka germ cells. (A to D) Ventral view of a foxl3~^~ XX gonad at 7 days 
after hatching. The white boxes are magnified in (B) to (D). Green cells (o/vas-enhanced green fluorescent protein) represent germ cells. Meiotic germ cells at 
zygotene (B) and pachytene [(C), dotted line] stages, and spermatid-like cells (D). Red dots in germ cells represent OLVAS protein. (E) Detection of protamine 
(purple signals) in foxl3~^^ XX gonad at 10 days after hatching. Scale bars, 10 |j,m. 
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Fig. 3. Progression of spermatogenesis in a female gonadal environ- 
ment. (A) RT-PCR analysis. Ov, ovary: Ts, testis. (B and C) Cross sections 
of adult gonads with immunohistochemistry. (B) Wild-type ovary. (Inset) A 
germinal cradle, in which a germline stem cell (arrow), cystic germ cells 
(yellow dotted line), and a diplotene oocyte (arrowhead) are surrounded by 
sox9b-expressing cells (red). (C) foxl3~^^ XX gonad. (Inset) Expanded ger- 
minal epithelium in which sox9b -expressing cells (red) surround germ cells 
(green). The yellow arrowhead indicates an oocyte. Scale bars, 50 |im. (D to 
G) Cross sections of adult gonads subjected to periodic acid-Schiff staining. 
(D) Wild-type ovary. (Inset) The germinal cradle hptween basement mem- 



brane (red arrowheads) and epithelial cells (black arrowhead) of the ger- 
minal epithelium (yellow dotted lines). (E) foxIS^''^ XX gonad. Spermatogenesis 
progresses within the expanded germinal epithelium (yellow dotted lines) 
that resides between the basement membrane (bottom inset, red arrow- 
heads) and epithelial cells (top inset, black arrowheads). The arrow indicates 
an oocyte in the stromal compartment. Asterisks in (B to E) indicate ovarian 
cavities. [(F) and (G)] Spermatogenesis proceeds as units of cysts [(F), red 
dotted lines] from distal (d) to proximal (p) position in wild-type testis (F), 
but this arrangement is disorganized in the foxl3~^^ XX gonad (G). Scale 
bars, 10 |im. 



Fig. 4. foxl3~^~ XX gonads produce func- 
tional sperm. (A) Fertilization rate of 
artificial insemination by using sperm from 
wild-type testes (n = 3) and foxl3~^~ ovaries 
(n = 8). In each artificial insemination, one 
gonad was used. (B) Hatching rate of the 
fertilized eggs in (A). (C) Unfertilized eggs. 
(D) Eggs fertilized by sperm derived from 
foxl3~^~ ovaries. The activated egg mem- 
brane is shown in (□’). (E) Embryos hatched 
from the eggs in (D). Statistics by two-tailed 
Student’s t test; < 0.05. 




of foxl3. Instead of oocytes, spermatid-like cells 
were present at the periphery of gonads in both 
mutant lines (NA17 in Fig. 2, A and D, and FHA8 
in fig. S4H); these cells expressed protamine, a 
sperm-specific gene (Fig. 2E and fig. S5). In ad- 
dition, other markers for spermatogenesis such 
as kifl7, tektin-t, and shippol were also expressed 
in foxl3~^~ XX gonads (fig. S5). At around the same 
stage in normal XY gonads, only type I germ cells 
are present (fig. S2J), and spermatogenesis does 
not begin until puberty, which occurs later than 
1 month after hatching. This indicates that in 
foxl3~^~ XX mutants, spermatogenesis begins much 
earlier than puberty in wild-type males. 



Chimeric analysis revealed that foxl3~^~ germ 
cells completed spermatogenesis in wild-type XX 
gonads, indicating that the phenotype is due to 
loss of foxl3 function in germ cells but is not 
affected by gonadal somatic cells (fig. S6). Fur- 
thermore, introduction of a bacterial artificial 
chromosome (BAG) containing^< 2 ?^ allele restored 
the formation of oocytes in foxl3~^~ XX larvae (fig. 
S7). Collectively, these data indicate that foxl3 is a 
germline-intrinsic factor involved in sperm-egg 
fate decision in medaka. 

To characterize the gonadal structures pro- 
ducing sperm in the XX mutants, we performed 
in situ hybridization and reverse transcription 



polymerase chain reaction (RT-PCR) using sex- 
specific markers. Throughout gonadal develop- 
ment from 10 to 30 days after hatching and in 
adult gonads of foxl3~^~ XX fish, the female 
markers and aromatase were expressed in 
somatic cells surrounding spermatogenic cells, 
whereas the male markers dmrtl and p45011l3 
were absent, reflecting the normal female ex- 
pression pattern in somatic cells (Fig. 3A and 
figs. S8, A to F, and S9, A to F). In adults, the 
secondary sex characteristics, which are distin- 
guished by the morphology of dorsal/anal fins 
and urogenital papilla (13), were of the female 
type in the XX mutants (fig. SIO, A and B). Con- 
sistent with this, the foxl3~'~ XX gonads exhibited 
morphologically ovarian structures characterized 
by an ovarian cavity on the dorsal side and a 
stromal compartment on the ventral side (Fig. 3, 
B and C, and fig. SIO, C and D). In wild-type ova- 
ries, a thin multilayered tissue called the germi- 
nal epithelium separates the ovarian cavity from 
the stromal compartment (fig. SllA). In addition, 
the germinal cradle, a unit consisting of sox9b- 
expressing cells that harbor germline stem cells 
and support early oogenesis (Fig. 3B), lies be- 
tween the epithelial cells of the germinal epithe- 
lium and the basement membrane bordering 
the stromal compartment (Fig. 3D and fig. SUB) 
(14). \nfoxl3~'~ XX gonads, spermatogenesis pro- 
ceeded within the expanded germinal epithelium 
(Fig. 3E and fig. Sll). Early stages of spermatogenic 
cells were inclined to localize on the basal side of 
the expanded germinal epithelium underlain by 
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the basement membrane and were surrounded by 
so^P5-expressing cells, forming a structure similar 
to the germinal cradles (Fig. 3C and figs. Sll and 
S12). A subset of these germ cells expressed nanos2, 
a marker of germline stem cells (14), suggesting 
that germline stem cells were established in the 
absence of foxlS fimction (fig. S9, G to I). 

In wild-type testes, spermatogenesis proceeds 
synchronously in cysts composed of Sertoli cells 
surrounding spermatogenic cells at the same 
stage. The cysts are arranged along a distal-to- 
proximal axis, so that the spermatogonia are 
located most distally and matured spermatids 
are released into the efferent duct in the most 
proximal position (Fig. 3F). In the expanded ger- 
minal epithelium of foxl3~^~ XX gonads, how- 
ever, the distal-to-proximal arrangement of the 
cysts was disorganized, so that spermatogenic 
cells at various stages were adjacent to each 
other (Fig. 3G). This observation suggests that 
feminized somatic cells are not able to arrange 
masculinized germ cell cysts in the proper distal- 
to-proximal manner, as Sertoli cells would other- 
wise do in wild-type testes. At the inner side of 
the expanded germinal epithelium, meiotic sper- 
matogenic cells did not always contact sox9b- 
expressing cells (fig. S12). Therefore, it is possible 
that spermatocytes in foxl3~'~ XX gonads could 
differentiate autonomously into sperm within the 
expanded germinal epithelium, as previously re- 
ported occurring in culture (i5). Occasionally, 
oocytes appeared in between spermatogenic cells 
in the expanded germinal epithelium (Fig. 3C), 
which subsequently exited as follicles into the 
stromal compartment (Fig. 3E), as seen in follicle 
formations of the wild-type ovary (fig. Sll) (14). 
Collectively, the expanded germinal epithelium in 
foxl3~'~ XX gonads harbors germline stem cells 
and is able to support follicle formation but can- 
not organize the cysts in a distal-proximal direc- 
tion. Any morphological trait of efferent ducts 
was not observed, possibly leading to the large 
accumulation of spermatogenic cells within the 
germinal epithelium of foxl3~'~ XX gonads. We 
conclude that foxl3~^~ XX mutants undergo sper- 
matogenesis in histologically functional ovaries. 

On the other hand, foxl3~'~ XY adult gonads 
exhibited morphologically normal testes with 



functional sperm (fig. SIO, E to H, L, and M), 
indicating that foxlS is dispensable for male 
development hi addition, chimeric analysis revealed 
that foxl3~'~ germ cells did not initiate preco- 
cious spermatogenesis in wild-type male gonads 
(fig. S6, B and D). These results imply that male, but 
not female, somatic cells are able to properly reg- 
ulate spermatogenesis in the absence of foxlS. The 
absence of male somatic cell regulation may allow 
foxl3~^~ germ cells to complete spermatogenesis 
much earlier than puberty in wild-type males. 

To investigate whether the sperm produced by 
foxl3~^~ XX mutants were functional, we per- 
formed artificial insemination. The gonads from 
2- or 3-month-old XX mutants were minced with 
forceps to suspend sperm in medaka Ringer’s 
solution, and ovulated eggs of wild-type females 
were subsequently inseminated by the resultant 
sperm. Approximately 50% of the eggs were suc- 
cessfully fertilized (Fig. 4A), as confirmed by the 
activation of the egg membrane (Fig. 4, C and D). 
The fertilized eggs exhibited normal development, 
and more than 95% of them hatched (Fig. 4, B and 
E). Therefore, the foxl3~^~ XX mutant produced 
functional sperm. Furthermore, foxl3~^~ XX mu- 
tants spontaneously spawned a few fertile eggs, 
although their fertility— as assessed by the number 
of spawned eggs, the fertilization rate, and the 
hatching rate— was lower than that of the foxl3^'~ 
mutants (fig. SIO, I to K). 

The presence of eggs in foxl3~^~ XX mutants 
suggests the existence of a /oa?/3-independent 
pathway controlling oocyte formation. The oocytes 
did not appear until around 20 to 30 days after 
hatching (fig. S8, G to I) in the mutants, a stage at 
which an ovarian cavity starts to form because of 
the action of estradiol (E2) (id). However, inhibi- 
tion of aromatase activity and E2 signaling by the 
administration of fadrozole (FAD) and tamoxifen 
(TAM), respectively, did not block oocyte formation 
in the mutants (fig. S13). On the other hand, ad- 
ministration of E2 during the embryonic and larval 
stages of both XX and XY foxl3~^~ mutants did not 
induce the precocious oocyte formation observed 
in E2-treated foxl3^'~ XY larva (fig. S14). These 
findings suggest that after derepression of sperma- 
togenesis upon loss of foxlS expression, oocytes are 
formed in an E2-independent manner. 



Our findings suggest that suppression of sperm- 
atogenesis by foxlS is an important component of 
female fate decision in germ cells. In this respect, 
we note that another gene of forkhead box tran- 
scriptional factor, foxl2, is involved in the mainte- 
nance of female fate in the mammalian supporting 
cells by suppressing the male fate (8). 
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Stockholm in the second week of December is a special place. The city is alive 
with excitement as it welcomes and celebrates the new Nobel Laureates at the 
annual Nobel Prize ceremony. 

If you are a PhD student, you could be here too - meeting a Nobel Laureate and 
receiving a rather special prize yourself. 

The journal Science & SciLifeLab have established The Science & SciLifeLab Prize for 
Young Scientists, to recognize and reward excellence in PhD research and support young 
scientists at the start of their careers. Its about bright minds, bright ideas and bright futures. 

Four winners will be selected for this international award. They will have their essays published 
in the journal Science and share a new total of 60,000 USD in prize money. The winners 
will be awarded in Stockholm, in December, and take part in a unique week of events 
including meeting leading scientists in their fields. 

'‘The last couple of days have been exhilarating. It has been an experience of a lifetime. 
Stockholm is a wonderful city and the Award winning ceremony exceeds my wildest dreams. ” 
—Dr. Dan Dominissini, 2014 Prize Winner 

Who knows. The Science & SciLifeLab Prize for Young Scientists could be a major step- 
ping stone in your career and hopefully one day, during Nobel week, you could be visiting 
Stockholm in December once again. 




This prize is made possible with the 
kind support of the Knut and Alice 
Wallenberg Foundation. This Founda- 
tion grants funding in two main areas; 
research projects of high scientific poten- 
tial and individual support of excellent 
scientists. 



The 2015 Prize is now open. The deadline for submissions is August 1, 2015. 

Enter today: www.sciencemag.org/scilifelabprize 

The 2015 Prize categories are: 

• Cell and Molecular Biology 

• Ecology and Environment 

• Genomics and Proteomics 

• Translational Medicine 
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The new Spark 10M multimode micro- 
plate reader is designed to offer greater 
flexibility and increased productivity for 
cell biology and genomics customers. 
From microbiology research and cell- 
based assays to rapid DNA quantification, 
the Spark 1 0M delivers a combination of 
advanced capabilities and exceptional 
ease-of-use to simplify your daily work. 

At the heart of the Spark 1 0M are Tecan’s 
unique Fusion Optics which offer users 
the choice of filter- or monchromator- 
based measurements— or even a combi- 
nation of both— at the touch of a button, 
meaning laboratories no longer have to 
make a trade-off between flexibility and 
sensitivity. This cutting-edge system is 
complemented by Fligh Speed Mono- 
chromators, which provides a complete 
absorbance scan, from 200 nm to 1 ,000 
nm, in less than 5 seconds. The Spark 
1 0M reader has been developed from the 
outset with cell-based assays in mind, 
and includes a host of software and hard- 
ware features designed to simplify cell 
biology protocols. 

Tecan 
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Automated Sample Storage 

The Verso is a truly modular automated 
storage platform that is easily configured 
to meet the needs of the most demand- 
ing sample management applications. 

The system can be scaled to fit sample 
capacities ranging from 100,000 to over 
five million tubes at temperatures from 
ambient to -20°C. Verso is capable of 
storing and processing up to 1 ,500 tubes 
and plates per hour, and allows loading 
and unloading of up to 70 sample racks 

at a time. For a fully automated workflow. Verso can integrate with 
Flamilton automated liquid handling workstations. With end-users in 
mind, the intuitive system allows users to easily perform most jobs 
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Fluorescence Illumination System 

The new Lumen 100-LED illuminator 
provides advanced, high-quality illumi- 
nation for a wide range of fluorescence 
microscopy applications. With a 25,000- 
hour lifetime, the Lumen 100-LED offers 
a broad array of LED’s covering wave- 
lengths throughout the visible spectrum 
and is suitable for the majority of fluoro- 
phores used in fluorescence applications. 
A wide range of adapters is available to 
connect the Lumen 100-LED to most up- 
right and inverted microscopes currently 
on the market. Directly coupled to the 
fluorescence port to maximize light ef- 
ficiency, the unit can be further enhanced 
by the addition of excitation filters to LEDs 
to further optimize the bandwidth for your 
specific application. Engineered to com- 
pletely eliminate power to the LED when 
set to zero light level, the Lumen 1 00 is 
ideally suited for electrophysiology and 
optogenetics as well as general fluores- 
cence imaging applications. A combiner 
is available if two LEDs need to be used 
simultaneously. 
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Western Blot Validated Antibodies 

PrecisionAb Antibodies are a catalog 
of rigorously tested primary antibodies 
validated for detection of endogenous 
proteins via Western blot. Scientists 
have struggled with antibody unreliability 
for decades because manufacturing 
standards and quality control vary 
widely among vendors, targets, and 
even among antibody lots. Each 
PrecisionAb antibody is screened on 
whole cell lysates from up to 12 different 
biologically relevant cell lines and only 
antibodies that detect endogenous 
proteins with high sensitivity and 
specificity were chosen for the catalog. 
Each antibody can be purchased as a 
trial size and comes with positive control 
lysates for easy in-lab validation to fit 
researchers’ needs. 

Bio-Rad 
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Confocal Raman Imaging Module 

The newest SWIFT v2 ultrafast confocal 
Raman imaging module is available 
for the XploRA PLUS and LabRAM 
Evolution. The new SWIFT v2 module 
greatly enhances the ability to obtain fast 
and detailed confocal Raman images 
with the click of a button. The module 
offers a 4x to 5x improvement in speed 
on even the fastest of Raman images, 
while maintaining the class leading 
confocal performance and sensitivity. 

It makes Raman imaging a realistic 
alternative to optical imaging techniques 
and can be used to survey 3-D sample 
structure and surface features. Its 
technology and speed will enable the 
user to focus on detail and the chemical 
information within an image, and is 
not constrained by any compromise in sensitivity or confocal 
performance, which has previously challenged Raman imaging. The 
new module offers a powerful yet cost effective route to employing 
the full power of Raman imaging in the laboratory. 
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For info: 732-494-8660 
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Coiiege of 
ihteoimhic 

FACULTY POSITION: 
BIOCHEMISTRY/GENETICS 

New York Institute of Teehnology-College of 
Osteopathie Medieine (NYIT-COM) seeks an 
Assistant Professor with teaehing expertise 
in bioehemistry/geneties in the Department of 
Biomedieal Seienees. The area of researeh foeus 
is open. The Department eurrently has aetive 
researeh in eardiovaseular disease/heart failure, 
renal physiology and development, neuroseienee, 
and mierobiology. Sueeessful eandidates are 
expeeted to eontribute to the medieal sehool 
teaehing effort and partieipate in an aetive 
researeh program. Collaboration with existing 
faeulty researeh aetivities is strongly eneouraged. 
The medieal sehool provides strong institutional 
support for both teaehing and researeh aetivities, 
ineluding maintenanee of eore faeilities, support 
for student workers, and aeeess to IT resourees. 

The sueeessful eandidate will possess a Ph.D., 
D.O., M.D., or D.V.M. and 2+ years of post- 
doetoral researeh experienee and demonstrated 
sueeessful medieal edueation/teaehing experienee. 
We offer institutionally-supported faeulty salaries, 
an extremely eompetitive benefits paekage, and 
a professional environment designed to enhanee 
eareer development. To apply, please e-mail eover 
letter and resume to: Dr. A. Martin Gerdes, 
Chair of Biomedical Sciences at agerdes(^nyit. 
edu . NYIT-COM, PO Box 8000 Northern Blvd, 
Old Westbury, NY 11568-8000. 

EOEM/F/D/V. 
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A POSTDOCTORAL POSITION is available 
at the University of Maryland School of Medicine to 
model inherited lipid storage diseases using iPSC 
technology. The candidate should have a Ph.D., ex- 
perience in Cell/Molecular Biology, excellent oral and 
written communication skills, and be able to work in- 
dependently. Experience in hESC/iPSC technology 
and hematopoietic/neuronal development is desirable. 
Salary is commensurate with experience. To apply, send 
curriculum vitae and contact information for three 
references to Dr. Ricardo A. Feldman at e-mail: 
rfeldman@som.umaryland.edu. 
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PLAY A LEAKING R«LE IN 
THE FIGHT AGAINST CANCER 

Director of Population and Early Detection Research 
Competitive + excellent benefits | London 

At Cancer Research UK, our goal is to bring forward the day when all cancers are cured. 

As Director of Population and Early Detection Research, you'll help us realise that vision 
by leading our research strategy for one of the key things in beating the disease: 
early detection b diagnosis. 

Based in our London head office, you'll drive and implement strategy, generate excitement in 
the research community, and help build our capacity and expertise - enjoying autonomy and 
flexibility, as well the right resources, environment and freedom to flourish in your career. 

To find out more and apply, visit the jobs section in cancerresearchuk.org 

Cancer Research UK is a registered charity in England and Wales (1089464), Scotland (SC041666) 

and the Isle of Man (1103). Registered company in England and Wales (4325234) and the Isle of Man (5713F). 

CANCER 

Let's beat cancer sooner. RESEARCH 

cruk.org 
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Institut Pasteur 


FOR OUR FUTURE 



The Institut Pasteur is hiring 
group leaders for its new Center of 
Bioinformatics, Biostatistics and 
Integrative Biology (C3BI) 

The new direction of the Institut Pasteur has defined 
Bioinformatics, Biostatistics and Integrative Biology as strategic 
priorities. A new center (C3BI) to foster research in these domains 
was set up in 2014. Substantial resources were allocated for the 
creation and development of the C3BI, with the recruitment of 
40 research engineers in bioinformatics and biostatistics between 
2014 and 2017. A building will be renovated on the Paris campus 
to house the C3BI. 

In this context, the Institut Pasteur is looking for several new 
senior and/or junior group leaders in bioinformatics, biostatistics 
and integrative biology, with a strong methodological component. 
The main focus is on computational and statistical analysis of 
biological "big data", typically produced by new generation 
sequencing and -omics technologies, but all modeling and 
computational approaches of biological questions closely 
connected with Institut Pasteur research areas are eligible. Highly 
attractive packages to match the experience of the candidate will 
be provided, including institutional salaries (principal investigator, 
permanent scientists, secretary and postdoctoral fellows), a 
substantial contribution to running costs and equipment, as well 
as support for relocation expenses and administrative issues. 



Candidate's profile 

Successful candidates will possess the following qualifications: 

• PhD with a minimum of 6 years of research experience 

• Recognized scientific leadership in bioinformatics/ 
biostatistics 

• Broad experience in methodological development for the 
analysis of various types of data 

• Consistent publishing record of cutting edge research as senior/ 
first author 

• Significant experience in managing and mentoring scientists, 
and in designing, financing and executing an innovative 
research program 

• Demonstrated ability to collaborate with experimental and 
computational biologists 

Application web site (forms and documents to be filled and 
uploaded): 

https://c3bi.pasteur.fr/research-teams-apply/ 

Deadline: September 20, 2015 midnight CEST 

Further information on the institute and C3BI can be found at 

http://www.pasteur.fr and https://c3bi.pasteur.fr/ 

Short-listed candidates will be invited for interview in October 2015, 
and results will be announced by mid-December. The new research 
groups will start by early 2016 (the precise date is negotiable). 

Informal inquiries can be addressed to Olivier Gascuel 
(Head of C3BI - C3BICall2015@pasteur.fr) 
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Postdoc Careers 

* August 28, 2015 

Reserve space by August 11 

THERE’S A SCIENCE TO REACHING SCIENTISTS. 






Sciences ‘ii For recruitment in science, there’s only one ScionCC 







This feature helps postdocs transition from one discipline or 
department to another. Topics include strategies for making such 
changes and discussions about the advantages/disadvantages of 
changing directions. 



What makes Science the best choice for recruiting? 



Post online and your 
job will be e-mailed 
to over 13,000 job 
seekers looking for 
postdoc positions. 



■ Read and respected by 570,400 readers around the globe 

■ 78% of readers read Science more often than any other journal 

■ Your ad sits on specially labeled pages to draw attention to the ad 

■ Your ad dollars support AAAS and its programs, which strengthens 
the global scientific community. 

Why choose this postdoc feature for your advertisement? 

■ Relevant ads in the career section with special postdoc banner 

■ 67% of our weekly readers are Ph.D.s. 



Expand your exposure. Post your print ad online to beneh't from: 

■ Link on the job board homepage directly to postdoc opportunities 

■ Dedicated landing page for postdoc opportunities. 



SCIENCECAREERS.ORG 
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To book your ad: advertise@sciencecareers.org 



The Americas Europe/RoW 

202-326-6582 -f44(0) 1223-326500 



Japan China/Korea/Singapore/Taiwan 

-F81-3-3219-5777 -h86-186-0082-9345 




y WLshingtnn Univ'ersity in Stlmiis 

School of Med[cd^e 

Faculty Positions in Biochemistry and 
Molecular Biophysics 

The Department of Bioehemistry and Moleeular Biophysies at Washington 
University Sehool of Medieine invites applieations for several tenured or 
tenure-traek faeulty positions at the level of Assistant, Assoeiate or Full 
Professor. Sueeessful eandidates will have established a strong reeord of 
researeh. Applieants seeking tenured positions must have a strong reeord of 
external funding. 

Outstanding individuals working in any area of bioehemistry and moleeular 
biophysies are eneouraged to apply. The eandidate’s researeh should be 
aimed at addressing fundamental questions related to moleeular meehanisms 
of biologieal or biomedieal relevanee. Current researeh in the department 
spans a wide range of topies ineluding eomputational biology, membrane 
proteins, moleeular motors, nueleie aeid / protein interaetions, protein 
strueture, enzymology and signal transduetion. Additional information about 
the department is available at http://www.bioehem.wustl.edu. Washington 
University has a highly interaetive researeh environment with vigorous 
interdiseiplinary graduate and medieal seientist training programs. Minority 
and women seientists are espeeially eneouraged to apply. 

Applieants should email their eurrieulum vitae and a brief deseription of their 
researeh interests to the Seareh Committee at bmbsearch@biochem.wustl.edu. 
Applicants should include contact information for three individuals who can 
write letters of recommendation. The committee will request letters as necessary. 

Completed applications will be reviewed on a rolling basis, starting 
immediately. For full consideration, applications should be received by 
December 1, 2015. 

Washington University is an Equal Opportunity Employer. We are committed 
to the recruitment of candidates traditionally underrepresented on university 
faculties. Individuals of any race, ethnicity, gender or sexual orientation 
are encouraged to apply, as are disabled individuals and veterans. The 
School of Medicine at Washington University is committed to finding 
solutions to global health problems, including ones that affect minority and 
disadvantaged populations. 
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INDIANA UNIVERSITY 

IlLOOMlNr.TON 

Scientist at the Bioomington Drosophiia Stock Center 

The Bloomington Drosophila Stoek Center at Indiana University Bloomington 
(http://flystoeks.bio.indiana.edu/) is seeking a seientist to join its management 
team. The BDSC is a large, dynamie and heavily used repository fox Drosophila 
melanogaster strains that serves researehers worldwide. lUB provides an 
outstanding intelleetual environment for BDSC aetivities, with more than a 
dozen researeh groups using Drosophila for studies in moleeular and eell biology, 
development, neurobiology, physiology, evolution and eeology. The work of 
BDSC seientists is wide ranging and ineludes management of eolleetion eontents 
(strain aeeessions, deaeeessions, quality eontrol), information management 
(bioeuration, database eonstruetion and use, website development), seientifie 
eommunieation (user support, grant proposals and reporting, eonferenee 
presentations, position papers, manuseripts), and business operations (personnel, 
user aeeounts, billing, finaneial reeordkeeping and reporting). The new BDSC 
seientist will partieipate in all aspeets of BDSC operations and eventually 
assume a primary leadership role in the organization. Candidates should have 
extensive knowledge of Drosophila geneties, exeellent eommunieation skills, 
proven grantsmanship, well-developed organizational abilities, interest in a long- 
term position, and a keen desire to support the Drosophila researeh eommunity. 

Applieants must hold a Ph.D. degree and have relevant postdoetoral experienee 
with a solid reeord of researeh aeeomplishments. Experieneed seientists are 
eneouraged to apply. All applieations reeeived by September 1, 2015 will be 
eonsidered. Applieants should submit a eover letter, CV and the names of three 
referenees to http://indiana.peopleadmin.com/postings/1603. Please address 
inquiries to Jeremy Bennett at (812) 855-6283; jebennet@indiana.edu; 1001 
E. Third St., Bloomington, IN 47405-7005. 

Indiana University is an Equal Employment and Affirmative Action Employer 
and a provider of ADA services. All qualified applicants will receive 
consideration for employment without regard to age, ethnicity, color, race, 
religion, sex, sexual orientation or identity, national origin, disability status 
or protected veteran status. 










Center for Immunology and 
Microbial Disease 
Albany Medical College 
Faculty Position 

The Center for Immunology & Microbial Disease at Albany Medical 
College invites applications for a tenure-track, junior or senior faculty 
position from individuals who have a doctoral degree, postdoctoral 
experience, and demonstrated research productivity. Those with an 
interest in host-pathogen interactions are particularly encouraged to apply. 
The successful candidate will be expected to establish an independent, 
extramurally-funded research program and participate in the teaching of 
medical and graduate students. The basic science departments at Albany 
Medical College are organized as interdisciplinary research centers and 
the Center for Immunology & Microbial Disease has a focus on microbial 
pathogenesis and immune defense, particularly as related to biothreat 
agents and emerging infections. The new faculty recruit will receive a 
competitive salary, an attractive start-up package, laboratory space in our 
newly constructed research building, and access to all departmental core 
services including the Center’s fully-staffed Immunology and ABSL-3/ 
BSL-3 Cores. Albany Medical College is located in a mid-sized city within 
the upstate New York Capital Region, and has easy access to Boston, New 
York City, and the Adirondack Mountains. 

Applicants should send their curriculum vitae, a statement of research 
plans, and three letters of reference to: 

Faculty Search Committee 

Center for Immunology & Microbial Disease 
Albany Medical College 
47 New Scotland Avenue, MC-151 
Albany, NY 12208 

For further information about the Center, visit: 

www.amc.edu/Academic/Research/imd.htm 

An Equal Opportunity /Affirmative Action Employer. 

Women andminorities are encouraged to apply. 






Scientific Director 
For the Center for Predictive Medicine for 
Biodefense and Emerging Infectious Diseases 

The University of Louisville seeks applieations and nominations for the 
position of Seientifie Direetor of its Center for Predietive Medieine for 
Biodefense and Emerging Infeetious Diseases (CPM). The Seientifie Direetor 
will work elosely with the team responsible for managing the NIH-affiliated 
Regional Bioeontainment Laboratory (BSL-2/ABSL-2,BSL-3/ABSL-3) at the 
University of Louisville and will be expeeted to lead researeh and edueational 
aetivities that eontribute towards meeting its operational and finaneial 
requirements. The Direetor will report to the Exeeutive Viee President for 
Researeh and Innovation and will be appointed to the Faeulty of the aeademie 
eollege and Department that is appropriate to their edueation and expertise. 

The ideal applieant will have a PhD, MD, or equivalent degree, a proven 
publieation reeord and an established researeh program with history of 
funding support. Experienee and/or expertise in a regulated environment is 
preferable. Applieants should review information at the CPM website http:// 
louisville.edu/predietivemedieine. Applieants should submit a CV, the names 
and eontaet information of four seientifie referenees, and a letter deseribing 
the applieant’s experienee in leading a multidiseiplinary researeh program 
that may eneompass eollaborations with individuals, aeademie institutions, 
government, industry, eontraet labs, and/or foundations. In addition, a 
statement that outlines plans for leading the CPM as its Seientifie Direetor 
should be provided. 

Applieations should be reeeived by September 15, 2015 for full eonsideration, 
although applieations will eontinue to be reviewed until the position is 
filled. Nominations and applieations will remain eonfidential until invited 
for eampus interview. 

All applieations and requested material must be submitted on-line at: https:// 
highereddeeisions.eom/uofl/eurrent vaeaneies.asp Job ID UL374. Use the 
left margin option. View Vacancy Announcement to read the ad, and/or the 
Applicant Guide to apply. 

The University of Louisville is an Affirmative UNIVERSITY OF 
Action, Equal Opportunity, Americans with | jf\|||C\/|| I p 
Disabilities Employer. U U T 1 t 
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WORKING LIFE 



By Vijaysree Venkatraman 



The space roboticist 

T he first motorized vehicle that Vandi Verma ever operated was a tractor. “I must’ve been 
11 years old at the time,” she told Science. During school vacations, she visited her grandpar- 
ents, who lived in a village in central India. At their farm, her uncle let her take a few turns 
behind the tractor wheel. Later, when she was a teenager, her father, who was a pilot with 
the Indian Air Force, taught her how to drive a car. That was unusual in India at that time, 
where those who could afford a car hired a driver. 



Today, Verma is one of the few peo- 
ple in the world who is qualified to 
drive a vehicle on Mars. 

Verma majored in electrical en- 
gineering in India and came to the 
United States to study artificial intel- 
ligence. She was captivated by the 
landing of the Sojourner Mars rover 
in 1997 and decided to apply her en- 
gineering skill to space exploration. 

She pursued a Ph.D. in robotics at 
Carnegie Mellon University and did 
internships with NASA’s Ames Re- 
search Center. She also got her first 
taste of robotic exploration here on 
Earth by field testing a rover that 
surveyed South America’s Atacama 
Desert for signs of life. 

After graduating, in 2005, Verma 
joined the intelligent systems divi- 
sion at Ames Research Center as a 
research scientist. Later, she moved 
to NASA’s Jet Propulsion Lab (JPL), the command center 
for the Mars rover missions. There, her robotics expertise 
and experience with field testing rovers won her a chance 
to drive the Opportunity rover. Verma drove Opportunity for 
3 years before graduating to the nuclear-powered Curiosity 
rover, which is now prowling Mars, examining its rocks to see 
whether it is, or was ever, a suitable habitat for life. 

Each day, before the rover shuts down for the frigid mar- 
tian night, it calls home, Verma says. Besides relaying scien- 
tific data and images it gathered during the day, it sends its 
precise coordinates. They are downloaded into simulation 
software Verma helped write. The software helps drivers plan 
the rover’s route for the next day, simulating tricky maneu- 
vers. Operators may even perform a dry run with a duplicate 
rover on a sandy replica of the planet’s surface in JPL’s Mars 
Yard. Then the full day’s itinerary is beamed to the rover so 
that it can set off purposefully each dawn. 

For the first 3 months after landing, from 5 August to 
5 November 2012, the team was working on Mars time. “I 
loved that we didn’t have to wait long after we uplinked our 



commands to see the results from 
Curiosity, and every sol (a martian 
day), we were doing something we’d 
never done before,” Verma says. 

Curiosity can scoop dirt, drill rock, 
and hand off samples to the onboard 
lab. While a sample is being ana- 
l 5 ^ed. Curiosity is already on its way 
to the next site. Verma helped write 
the code that lets Curiosity juggle 
these tasks. ‘We have to drive on 
to find newer things for the slew of 
instruments to analj^e without com- 
promising the rover hardware or the 
sample,” she says. 

She loves her day-to-day respon- 
sibility for the machine. “You defi- 
nitely don’t want to be the one who 
drove the rover off a cliff! But I find 
it energizing rather than stressful. 
You’re completely focused.” 

But she has not left research be- 
hind. One of Verma’s key research goals has been to give 
rovers greater autonomy to decide on a course of action. She 
is now working on a software upgrade that will let Curiosity 
be true to its name. It will allow the rover to autonomously 
select interesting rocks, stopping in the middle of a long 
drive to take high-resolution images or analyze a rock with 
its laser, without any prompting from Earth. 

Originally designed for a 2-year mission. Curiosity is still 
going strong and has already made many scientifically signif- 
icant finds. ‘With every drive, we get to explore new terrain 
that no human has seen in this kind of detail,” Verma says. 

Although human spacefaring has stalled, Verma says the 
spirit of exploration is alive and well in space robots. “I am 
happy to be working in robotics, pushing the envelope on 
space exploration,” she says. “We have reached Mars, our 
neighboring planet. We have only just begun.” ■ 



Vijaysree Venkatraman is a Boston-based science journal- 
ist. For more on life and careers, visit sciencecareers.org. 
Send your story to SciCareerEditor@aaas.org. 




‘7 am happy to be ... pushing 
the envelope on space 
exploration.” 



338 17 JULY 2015 • VOL 349 ISSUE 6245 



sciencemag.org SCIENCE 



Published by AAAS 





